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STATEMENT OF FOCUS 



The Wisconsin Research and Development Center for Cognitive Learning 
focuses on contributing to a better understanding of cognitive learning by 
children and youth and to the improvement of related educational practices. 

The strategy for research and development is comprehensive. |It includes 
basic research to generate new knowledge about the conditions and processes 
of learning and about the processes of instruction, and the subsequent devel- 
opment of research-based instructional materials, many of which are designed 
for use by teachers and others for use by students. These materials are tested 
and refined in school settings. Throughout these operations behavioral scien^- 
tists, curriculum experts, academic scholars, and school people interact, in- 
suring that the results of Center activities are based soundly on knowledge of 
subject matter and cognitive learning and that they are applied to the improve- 
ment of educational practice. 

This Technical Report is from the Project on Basic Pre-Reading Skills; 
Identification and Improvement in Program 1. General objectives of the 
Program are to generate new knowledge about concept learning and cogni- 
tive skills, to synthesize existing knowledge, and to develop educational 
materials suggested by the prior activities. Contributing to these Program 
objectives, this project's basic goal is to determine the processes by which 
children aged four to seven learn to read, and to identify the specific reasons 
why many children fail to acquire this ability. Later studies will be con- 
ducted to find experimental techniques and tests for optimizing the acquisi- 
tion of skills needed for learning to read. 
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ABSTRACT 



This report summarizes 1968-69 studies of component pre-reading skills 
of kindergartners » Working from the assumption that independent component 
skills might exist, four' skill areas were selected for investigation on the 
basis of literature findings and a decoding model of reading: visual, acoustic- 
phonetic, letter-sound association, and vocabulary skills. A research strategy 
is presented for the development and validation of assessment tests in each 
area and the subsequent development and validation of training procedures. 

The research reported here fs’ concerned with skill assessment in each of the 
areas . 

The four skill areas were investigated through the use of experimental 
Basic Skill Test batteries administered individually to kind erg artnors from a 
lower-middle class population in Madison (N = .21) and a lower class popula- 
tion in Beloit (N = 22) , Madison Sg participated in three 30- to 50-minute 
sessions in November, December, and January; Beloit ^s in one 45-minute 
session in March, 

Methods, results, and discussion are presented in detail for each of the 
four skill areas and invalid test procedures identified (e.g,, same-different 
tests of rhyming) , In the remaining tests , errors are examined for sources of 
difficulty (e.g., left-right letter reversals) and the pattern of intertest cor- 
relations examined. The findings support an assumption of independent skills 
related to reading. 
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INTRODUCTION 



For the past 3 years, the authors have 
been engaged in the study of cognitive skills 
related to the acquisition of reading . The 
primary goal of the research has been in- 
creased knowledge about the early stages in 
the development of the reading process, fo- 
cusing on those aspects of early reading 
which involve decoding or translating from 
the written to the spoken form of the language, 
as opposed to what is commonly referred to 
as “comprehension." 

The plural, cognitive skills « is appropriate 
for two reasons. First, reading is not a uni- 
tary ability but a collection of abilities which 
develop rapidly during the primary grades. 

For example, the first grade child, slowly 
translating strings of abstract visual symbols 
into th- familiar spoken language, is almost 
certainly operating in a different manner than 
a college student scanning through a text. 
Second, each of these abilities requires .of the 
child a variety of prerequisite cognitive skills . 
As will become obvious, from our point of 
view cognitive skills are very specifically 
defined: recognizing that two visual forms 
are identical, locating a test form in an array, 
or detecting whether two spoken words rhyme. 
For other investigators , cognition is virtually 
synonomous with theory or knowledge; our 
thinking about cognition has been closer to 
the approach which has come to be called 
“human information pro ces sing.’' 

A second aim of our research project is to 
•use knowledge about tha reading process 
gained through basic research on component 
cognitive skills to direct the development of 
more effective procedures for teaching read- 
ing than are now available. It should be 
stressed that we are not looking for a new 
and better way of teaching^ 2 §r se. The sub- 
stantial body of research on the relative ef- 
ficiency of various methods of reading instruc- 
tion has not produced many exciting results 
(e.g., Chall, 1967; Bond & Dykstra, 1967). 
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It still takes too long for most children to 
achieve a reasonable level of competence in 
reading, and there are still far too many school 
children who are functionally illiterate. To 
repeat the point made by Eleanor Gibson (1965) i 
"good pedagogy is based on a deep understand- 
ing of the discipline to be taught and the nature 
of the learning process involved (p. 1072)." 

This paper will not concern itself with the 
second aim above of improving reading instruc- 
tion, because we have yet to grasp that "deep 
understanding." For those who like to specu- 
late about how reading ought to be taught, 
there is adequate stimulation in the technical 
and popular literature. Instead, this Technical 
Report will focus on the methodological base 
from which we are currently working . 

Attacking a problem like "how does a child 
learn to read?" is an interesting challenge to 
those of us with a bias toward reductionism . 

Even when the problem is limited to the decod- 
ing stage, it is obvious that there are a multi- 
tude of task demands — visual and auditory 
perception, attention, learning, inductive 
reasoning, and so on — any of which might 
easily absorb one for a lifetime of investi- 
gation. How dees a person carry out research 
that is at both extensive and intensive? There 
are compelling social reasons for trying to lo- 
cate the most significant "hang -ups" in read- 
ing instruction before another generation passes. 
Yet it is all too easy to deal superficially 
with these problems. Scanning the literature 
on reading, one becomes convinced of two 
things; (a) almost any aspect of a child’s 
intellectual performance correlates to some 
extent with reading achievement and (b) al- 
most any remedial effort will be of some help 
to the child in learning to read. In fact, the 
most relevant variable in determining the suc- 
cess of any remedial program would appear to 
be time — the amount of time a teacher spends 
in one-to-one contact with an individual child 
(Monroe, 1932, p. 150). 

fiOOlO 
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Our research decisions have been guided 
by certain decisions about the innovative po- 
tential of various areas. First, we have de- 
cided to concentrate on the initial phases of 
reading instruction, including reading readi- 
ness. Accordingly, we are most interested in 
the performance of children between 4 and 7 
years of age. Second, we have decided that 
those children with the poorest prognosis rep- 
resent the greatest opportunity for improvement, 
although they also pose the greatest challenges 
in research. 

To those readers who are experimental psy- 
chologists, these introductory remarks may 
strike a strange note. Not since Tinker's 
(195 8) paper on eye-movements in reading has 
there been an article in either the Psychological 



Bulletin or Psychological Review concerned 
with reading. Just as B. F. Skinner bemoaned 
the "flight from the laboratory, " so there might 
be cause lor concern about the absence of 
learning psychologists from the classroom. 

This situation has begun to change lately; for 
example, reading research programs have been 
established in the last few years under the 
supervision of "hardheaded" experimentalists 
such as Eleanor Gibson, Harry Levin, Richard 
Atkinson, Harry Silberman, and Robert Glaser, 
to mention a few. In addition, current research 
in human information processing, psycholinguis- 
tics, and developmental cognitive psychology 
has had considerable influence on our work on 
reading as will be pointed out in connection 
with specific studies in the following chapter. 



II 

TESTS OF BASIC SKILLS; AN ASSUMPTION AND A STRATEGY 



The Basic Skills Test (BST) Package de- 
scribed below was designed as a research 
vehicle which would facilitate exploratory 
research. The test package has potential as 
a diagnostic tool, and could be expanded into 
an instructional program. At present, it serves 
as a matrix by which miniature experiments are 
conducted in areas of relevance to beginning 
reading instruction. We have thought it pre- 
mature to concern ourselves with formal as- 
pects of test construction and evaluation until 
more has been learned about the significance 
of specific component skills and the adequacy 
of alternative methods of assessment. 

The principal assumption guiding this re- 
search is the existence of separable and inde- 
pendent performance skills which are prerequi- 
site to the acquisition of literacy. The assump- 
tion of specific and independent skills is 
implicit in much that has been written about 
reading, although the implications of the 
assumption have rarely been realized in prac- 
tice. Stern's (1968) report on the UCLA Pre- 
school Language Project is a recent example 
of this approach, or one can go back four 
decades to Monroe (1932) who conducted simi- 
lar research most competently. 

An opposing point of view might propose 
requisite abilities of a more global or matura- 
tional nature, as implied by concepts such as 
"reading readiness" or "critical age." Under 
the assumption of specific skills, one is led 
to try to identify specific deficiencies which 
are then the focus of remedial procedures , 
while the assumption of a global factor would 
suggest the adoption of testing ajd training 
procedures of an amorphous type. This con- 



A more or less equivalent form of the global 
hypothesis states that reading performance 
depends on a single primary factor such as 
visual perception, auditory-visual coordina- 
tion, "language development, " etc. 
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trast can be seen in the comparison between 
the highly structured preschool program of 
Bereiter and Engelmann (1966) and enrichment 
or language-experience programs such as those 
of Alpern (1966) or Brottman (1968). 

Starting from the assumption of sparable 
component skills, we have adopted the following 
strategy in identification and investigation of 
specific skills. To start, the skill area is 
delimited as precisely as possible. Then sub- 
tests are designed which "surround" the skill, 
the aim being to select convergent subtests 
which sample the skill in a variety of different 
contexts (Haber, 1969). For example, in tests 
of visual matching ability described below, 
several variations in task format and materials 
were employed. Single-letter matching tests 
were administered in simultaneous and suc- 
cessive modes (i.e. , the letter to be matched 
was either present or absent when the test 
letters were shown) , and stimulus materials 
were varied over a wide range, from real toys 
which the child could actually manipulate, to 
groups of letters printed in upper or lower case. 

An attempt is then made to identify the un- 
derlying psychological processes involved in 
the ability. It is at this point that the "how" 
of testing becomes a major concern. The 
clarity of the instructions, the qua!Uy of the 
stimulus materials, the skill of the tester — 
these and similar general factors always con- 
tribute to a greater or lesser extent to a child's 
performance. Ideally, the contribution of these 
general factors should be minimal. 

Our approach to this problem has been to 
establish a basal level of performance by start- 
ing with materials of minimum difficulty for the 
skill being tested. If the error rate is too high 
with these materials, we conclude that the 
testing procedure is at fault and must be im- 
proved; If the basal performance is satisfac- 
tory, the test is systematically complicated 
by variation in materials or task procedure 
until the children begin to make errors. Then 
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we look for "miniature" training procedures whicn 
can be introduced into the testing sequence at 
the appropriate point to help a child over the 
hurdles. 

It . should not be thought that the approach 
is to "try everything and see what works , " 

The original decision to concentrate on the 
decoding process has meant that priority be 
given to investigation of visual and auditory- 
phonetic processes, and of learning spoken 
responses to abstract symbols. Some areas 
(e.g., vocabulary) have been selected for 
study on the basis of our judgment and evalu- 
ation of the work of other investigators. 

In summary, the BST package was designed 
as an in-depth test of a limited set of cognitive 



skills . Although constructed 'in the form of a 
diagnostic test, the package may also be 
viewed as a matrix within which miniature 
experiments were conducted. Many of the 
experiment's were duds, in the sense that the 
basal performance was so poor as to be of no 
use in isolating sources of difficulty. From 
our point of view, such results mean that 
work remains to be done to simplify the test- 
ing procedure. These tests aim primarily at 
uncovering psychological processes, and only 
secondarily at prediction of later achievement. 
Intercorrelations between subtests will occa- 
sionally be reported, mainly to demonstrate 
the extent of independence of the measures. 



BASIC SKILLS TESTS: GENERAL METHOD^ SUBJECT SAMPLES 



The BST package covers five areas of cog- 
nitive functioning: matching of visual forms . 
auditory- phonetic identification , letter -sound 
association , vocabulary knowledge , and gen- 
eral achievement . First the general procedures 
used in administering the test package will be 
described; then the findings within each of 
these areas will be presented and discussed. 

The entire series of subtests is listed in Table 
1. The series was divided into three sets, 
each of 30 to 50 minutes' duration, and given 
at 1 -month intervals between November 1968 
and January 1969 to : kindergartners in Madison, 
Wisconsin. This group of children will be re- 
ferred to as Msn. In March 1969, 10 of the 
subtests in Table 1 were administered in a 
single session to kindergartners from a school 
in Beloit, Wisconsin. This group will be re- 
ferred to as B(^. The Bel group was tested to 
replicate some of the more prominent findings 
from the earlier testing. 

The Msn sample consisted of 21 subjects, 

11 boys and 10 girls, from an afternoon kinder- 
garten class in a predominantly lower -middle 
class area of the city. The mean age of the 
21 Ss in November was 64 months, ranging 
from 59 to 70 months. The Metropolitan Readi- 
ness Test was administered in March 1969. 

The Bel sample consisted of 22 kindergartners 
from a predominantly lower-class area. Eleven 
kindergartners were drawn at random from the 
morning session and 11 from the afternoon ses- 
sion. The sample included 10 boys and 12 
girls, with a mean age in March of 69 months, 
ranging from 63 to 75.'months.2 The Metropolitan 
Readiness Test was administered to the class 
in May, 1969. 



2 

Five of the 22 children in the sample were 
Negro, reflecting the proportion of Negro chil- 
dren in the school. The data from these chil- 
dren have not been analyzed separately. 



Test variety and total test time were con- 
sidered in arranging tests for the three differ- 
ent sessions administered to the Msn group 
and in the selection and arrangement of tests 
for the single Bel session# Within each ses- 
sion, easy and difficult tasks were alternated 
insofar as possible to sustain attention. The 
experimenters were male and female graduate 
and senior undergraduates at the University of 
Wisconsin, Madison • Detailed scripts were 
prepared for each of the subtests, and experi- 
menters rehearsed the scripts before giving 
any of the tests to insure the equivalence of 
procedures and instructions. Each experimenter 
tested at least two children for practice. The 
data from these practice subjects were not in- 
cluded in the results reported below. ^ / 

Two forms of each subtest were constructed. 
In some instances, the alternate forms actually 
comprised different experimental treatments . 
Children were randomly assigned to form A or 
3 independently for each session with the con- 
straint that approximately equal numbers of 
subjects were tested on each form. Subjects 
were tested individually in small rooms at each 
school. Responses were recorded as unobtru- 
sively as possible. Tests involving spoken 
responses were recorded with a Uher 5000 tape 
recorder and a Shure 545L lavalier microphone.^ 
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A few remarks might be made about testing 
young children over an extended period of time 
First, instructions are always crucial, but es- 
pecially when success in a test depends upon 
comprehension of key words or expressions 
such as "rhyme" or "sound the same." The 
tester must then rely on indirect tests, exam- 
ples, gestures, or pre-training procedures for 

4 

Phonemes will be indicated in slashes / / 
; by the International Phonetic Alphabet. 



Table 1 

Basic Skills Test Package, Order of Tests, and Time per Test 





Session/Session Order 


Approximate 
Time (Min,) 




Test Name 


Msn 


Bel 


Notes 


Visual Matching: 










Matching 


I-l 


1 


5 




Matching-Retest 


1-9 


8 


4-6 


One group tested after VE 
training, one group after 
Repr, Training 


Oddity-Selectiori 


1-5 




3 




Memory-Matching 


III-2 




10 




Alphabet Knowledge: 










Production 


1-2 


2 


1 




Recognition 


1-3 


3 


2 




Acoustic-Phonetic: 










Rhyming 


1-6 




6 


'Same-different 


Rhyming -Retest 


II-6 




4 


Same-different 


Initial-Sounds 


I-IO 




6 


Same-different 


Rhyme-Production 


II-5 


5 


3 




Segmentation 


III-3 


9 


10 




Letter-Sound Association: 










Context-Learning 


I-ll 




7 




Letter-Word 


II-4 




10 




Alphabet-Learning 


III-l 




15 




Vocabulary Knowledge: 










Picture-Naming 


1-4 


4 


8 




Line-Drawing-Naming 


II-l 


7 


6 




Sorting 


II-2 




10 




General Abilities: 










Word-Memory Span 


1-7 


6 


2 




Simple Directions 


1-8 

II-3 




3 


"Simon-Says" game 



Note — Msn sample was tested over three sessions, I, II, and III in November, December, 
and January, respectively; Bel sample was tested in a single session in March, 




communication. Second, for a test session of 
half an hour or more, keeping the child's atten- 
tion is a serious problem. In our experience. 



attention is best maintained if verbal instruc- 
tions are kept to a. minimum. Constant changes 
in the format of the tests helps, as does alter- 
nation of easy and difficult problems. 



HrtOOlS 
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IV 

VISUAL TASKS 



To read English a child must learn to iso- 
late, differentiate, and identify the letters of 
the alphabet. He must learn both to identify 
these symbols in spite of variations in type 
styles and size, and to disregard structural 
variations such as upper- and lower-case forms. 
He must learn to work with ordered sequences 
of these symbols, noting the elements in such 
sequences, and treating as equivalent only 
sequences in which both elements and order 
are the same. 



METHOD 

The visual tasks in the BST package were 
designed to evaluate relatively , low-level skills 
in this area, using multiple-choice or recogni- 
tion responses; identification, labeling,, and 
reproduction performance were not measured. 
Task, instruction, training,* and test materials 
were varied over the several subtests. 

On Matching and Matchinq-Retest > the child 
was shown a standard stimulus together with a 
test set of four or five alternatives, and was 
asked to pick the single alternative that matched 
the standard. In Memory-Matching . the stand- 
ard was presented alone, removed, and then the 
test set was presented 2 seconds later. The 
child thus had to rely on his memory of the 
standard in making the match. Finally, in 
Oddity-Selection the child had to point out the 
odd member in a set of four alternatives . 

Materials were varied in the following ways . 
To determine the difficulty of the matching task 
itself, small, three-dimensional toy objects 
were used as the first few items of Matching . 
The remaining items were printed in capital 
and lower-case letters, first single letters and 
then letter groups— pairs , triples, and quad- 
ruples. The items were printed on strips of 
tagboard in a horizontal row of letters . 5 in . 
high, with the standard on the left. The test 
sets always included one or more alternatives 



designed to be highly distracting. Thus single- 
letter and letter- pair standards v/ere matched 
with visually confusing alternatives (e.g., b 
and ^ . For letter-pair standards , the test 
always contained an order reversal (e.g,, 
for gp) . For groups of three or four letters, the 
test set consisted solely of permutations of 
the letters in the standard (e.g., rmn, mm, 
nmr) . The odd member in Oddity-Selection was 
either visually similar (single letters), or a 
reversal (letter pairs), or was formed by a 
permutation of the letter sequence or replace- 
ment of a single-letter in the sequence by a 
visually similar letter (three or four letters) . 

An example would be VWV, WW, VWV, VWV . 

Preliminary analyses of matching data from 
pilot studies had revealed that most errors in- 
volved a failure to preserve order (e.g., if SZ_ 
was the standard, ^ was as likely to be se- 
lected as . Accordingly, prior to the 
Matchinq-Retest . children were given one of 
two types of supplementary instructions or pre- 
training. Either the importance of order was 
verbally elaborated upon using a single ex- 
ample (V. E.), or the child received a series 
of seven items in which he arranged individual 
letters to match a standard, with feedback on 
the correctness of the reproduction (Repr.). 



RESULTS 
Single-Letter Items 

Performance on the various subtests is sum- 
marized in Table 2. 

The effects of variation in material are quite 
apparent, Only the Msn sample was tested on 
the toy or "object" materials. On the first two 
Items, the set of alternatives consisted of four 
very different toys. There were 2 errors out of 
42 possible, an error rate of less than 5%. On 
the third "object" item, the standard was a 
safety pin and the alternatives were three safety 
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Table 2 

Percentage Correct Responses on Visual Matching, 
Related Subtests of BST, and 
Metropolitan Readiness Matching Subtest 



Test/ Materials 


Msn 


Bel 


Mean 


S.D, 


Mean 


S.D. 


Matching; 










Objects (3)^ 


89 


19 


— 


— 


Single Letters (10) 


83 


12 


87 


11 


Double Letters (5) 


36 


36 


48 


28 


3-4 Letters (7) 


30 


16 


35 


54 


Match.lng-Retest; 










Double Letters (3) 


67 


26 


56 


39 


3-4 Letters (6) 


40 


28 


33 


71 


Memory-Matching: 










Single Letters (4) 


93 


24 


— 


— 


Double Letters (5) 


42 


20 


— 


— 


3-4 Letters (7) 


42 


25' 


— 


— 


O dd ity- S e le c tio n : 










Single Letters (2) 


86 


23 


— 


— 


Double Letters (4) 


29 


25 


— 


— 


3-4 Letters (6) 


29 


23 


— 


— 


Alphabet-Production: 


27 


27 


33 


31 


Alphabet -Re cog nit ion: 


23 


77 


18 


82 


Metropolitan Readiness: 










Matching (14) 


69 


19 


49 


25 



^Number in parentheses is npmber of different items included in each category. Guessing 
rate is 25% except for Mphabet Tests (4%), Metropolitan (33%), arid 8 of the 10 single- 
letter matching items (20%) , 



pins of varying size and- color, and a red 
whistle. Increai'.ing the similarity of the test 
set increased^the error rate to 25%, indicating 
the importance of the make-up of the test set, 

• On the other hand, analysis of the single- 
letter items indicates that for our samples of 
children, the letters in the English alphabet 
were not perceived as confusing, though they 




had been selected as such. Except for the 
first two items, the single-letter series were 
constructed to be quite difficult, since pilot 
testing during the summer of 1968 had pro- 
duced extremely low error rates on single let- 
ters, Typical materials were G - C Q, G D 
* Q » k • k k k kill snd i. - n m w ^u., “ Diffi- 
culty*' was based in part on feature .analysis 



of the upper- and lower-case letters (Gibson, 
1965), and in part on findings that right -left 
and up-down reversals often produce confusions , 
The right-left mirror image transformation 
was the major source of errors. Three single- 
letter test sets included a right-left reversal 
of the standard (e.g., b d) , The error rates 
on these items were 46% (Msn ) and 41% (Bel) , 
with selection of the reversal alternative ac- 
counting for 85% of these errors . For the re- 
maining seven items in the single-letter series 
there were two errors in both the Msn and 
samples out of 147 and 154 opportunities re- 
spectively. Dunn-Rankin (1968) asked second 
and third graders fto judge the similarity of 
lower-case letters; for the two sample sets 
above (b and r_ standards) , the perceived simi- 
larity of the test sets is about equal from Dunn- 
Rankin' s ratings, but the error rate for the b 
item was 40%, whereas thexiteni was correctly 
matched 100% of the time. 



Letter Groups 

The error rate increased sharply when letter 
groups were tested, particularly triples and 
quadruples where performance was nearly at 
chance performance level of 75%. For bigrams, 
one of the alternatives was an order reversal 
of the standard pair and tv/o were visually con- 
fusable with the standard. Thus, a typical 
item might be CQ — OQ QC CQ CO. Of the 
bigram errors on Matching, 70% (Msn) and 65% 
( Bel) were choices of the reverse order alterna- 
tive; if the errors had been the result of random 
selections among the alternatives, only 33% of 
the errors should have been reversal errors. 

In the Matching -Retest . 5 2% (Msn ) and 43% 
(Bel ) of the bigram errors were reversal errors. 
Unfortunately, the placement of reversal alter- 
natives was not balanced for serial position 
between the A and B groups, and so a break- 
down of reversal errors by treatment groups is 
meaningless. On Memory-Matching , 54% 

(Msn ) of the bigram errors were reversal errors. 



Serial Positions 



were constructed of the same letters as the 
standard. If letter-order information were not 
preserved by the child, and if the child scanned 
the alternatives from left to right, one would 
predict the distribution of errors would resemble 
the data in Table 3. The presence of the 
standard to the left of the alternatives was 
important in directing the scanning process, 
since errors were randomly distributed when 
the standard was not present. 

The data on the right of Table 3 provide 
further information on the scanning process. 

The percentage of reversal errors in bigram 
matching are shown as a function of whether 
the correct alternative was to the left of the 
reversal alternative (C-R) , or to the right 
(R-C) . These data are shown for each of the 
matching tasks , separately for the V. E. 

(verbal elaboration) and Repr. (reproduction) 
groups. In Matching , if the reversal alterna- 
tive were to the left of the standard in the 
test set, it was quite likely to be chosen. 

This result also held in Matchinq-Retest fol- 
lowing V. E. but much less so following Repr, 
Thus, the training procedure affected the type 
of errors made by the children, though not the 
rate of errors . In Memory-Matching , the re- 
versal alternative was equally likely to be 
selected v/hether to the right or left of the 
correct alternative, supporting the conclusion 
that scanning was unsystematic in this task. 



Oddity-Selection 

Overall performance level on the Oddity- 
Selection task, in which the child was asked 
to point out "which one of these is not like 
the others" was comparable to performance of 
the other matching tasks, which would suggest 
that these tasks all tapped similar component 
skills. On the other hand, one child insisted 
that for eight of the twelve items, all four 
alternatives were identical; in seven other 
instances a child refused to point out any of 
the alternatives as different from the others . 

In the other matching tasks, there were only 
two cases of this sort. There was no evidence 
of serial position effects in this task. 



The items were displayed* in a horizontal 
array with the standard to the left. There were 
enough errors in the letter -group series for 
analysis of serial position effects, which 
showed (Table 3) that in the Matching and 
Matchinq-Retest tasks, the children were scan- 
ning from left to right , In the Memory-Matching 
task, no pattern to the errors was evident. In 
the letter-group items, all of the alternatives 
were highly similar, in the sense that they 



Metropolitan Readiness Match 

Performance on the Matching subtest of the 
Metropolitan Readiness Tests (Hildreth, 
Griffiths, & McGauvran, 1964) was somewhat 
higher than on the BST matching tests. This 
subtest consists of fourteen items, with the 
same format as M atching in BST (horizontal 
array, standard to the left) , A variety of 
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Table 3 



Breakdown of Total Errors , Showing 
Position Effects in Visual Matching Tests 



Test/School 


Per Cent of Total Errors 
at Each Serial Position 


Per Cent Bigram 
Reversals^ 


1 


2 


3 


4 


Training 

Group 


C-R 


R-C 


Matching: 
















Msn 


63 


21 


8 


8 


V.E. 


15 


67 












Repr . 


14 


76 


Bel 


52 


23 


12 


13 


V.E. 


14 


52 












Repr. 


14 


40 


Match inq-Retest : 
















Msn 


78 


10 


6 


5 


V.E. 


15 


70 












Repr. 


14 


27 


Bel 


50 


27 


7 


16 


V.E. 


14 


55 












Repr. 


5 


18 


Memory-Matchinq: 
















Msn 


21 


27 


23 


29 


— 


29 


33 



Per cent of reversal errors on two -letter stimuli when correct choice was to left (C-R) or 
right (R-C) of reversal choice, by V,E.-Repro. training groups; training given just prior to 
Matchinq-Retest . 
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standard items is used in this test — words 
from four to nine letters long in upper- and 
lower-case, as well as various kinds of ab- 
stract figures. Each test set contains three 
alternatives which would produce a higher 
guessing rate than for the BST matching tests. 
The alternatives are similar to the standard, 
consisting of letter permutations for the words 
and minor figural variations for the forms . 

Also in BST Matching , the leftmost alternative 
in the test set was correct for only two of the 
twelve letter groups, whereas in the Metro- 
politan Matching , the leftmost alternative is 
correct for five of fourteen items . Given the 
previously noted bias for the leftmost alterna- 
tive, these arrangements would also produce 
fewer errors on the Metropolitan compared to 
the BST. Finally, the BST was administered 
in the fall, the Metropolitan the following 
spring. Hence, no significance is attached 
to the higher performance on the Metropolitan 
Matching test. Comparisons between perform*^ 
ance on these various tests will be discussed 
in a later .section on intertest correlations. 
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DISCUSSION 

There are three general considerations in 
the relation of visual perception to reading. 

First is the question of isolation , discrimina- 
tion , and identification of the abstract charac - 
ters which serve as letters in the English alpha- 
bet. Second is confusion of left -right mirror 
images , which is related to the first question 
but warrants discussion as a separate issue,. 
Third is the perception , analysis , and storage 
of written words . 



Letter Perception 

Most children in this country encounter 
printed materials at an early age . Few of the 
kindergartners we tested "knew their ABC's" 
but most attempted the letter- matching task 
without hesitation and performed quite well, 
given the confusability of the English alphabet 
and the fact that they could not label many 
letters. Whether matching is conceived as a 




process involving templates or distinctive- 
features {Neisser, 1967), the single-letter test 
sets were highly similar, and are judged as 
such by second graders (Dunn -Rankin, 1968), 
and yet error rates were quite low except for 
right-left reversals . 

Gibson, Gibson, Pick, and Osser (1962), 
in what has become a classic study, investi- 
gated the degree to which letter-like standards 
were confused with various transformations of 
the standard — perspective, rotation and reversal, 
line to curve, and break-and-close. The format 
of their test was generally the same as that 
used in BST. There were 12 alternatives in 
each test set, and the children were instructed 
to select as many alternatives as they wished 
from the set, since sometimes more than one 
copy of the standard would be included. 

From the ages of 4 to 6, both right-left and 
up-down reversals produced errors (choice of 
a transform as equivalent to standard) at a rate 
of from 50% to 15%. Kinder gartners were also 
tested on transformations of real English letters 
and here the error rate for right- left reversals 
(b, ^ was more than double that of up-down 
reversals (b, £) — 19% vs. 8.5%. 

Any attempt to identify transformational re- 
lations among real letters is risky, but as an 
example, if it is assumed that C and O are re- 
lated by a break-and-close transformation, or 
V and U by a line- to -curve, then the error rates 
in Gibson, et al . . would lead one to expect 
much higher error rates in BST matching tests 
than were actually found. In both of our sam- 
ples, the overwhelming preponderance of errors 
made by the children on single-letter tests 
were right-left reversals; only 10 out of 70 
errors could be classified as up-down reversal, 
close-break, or line -to- curve. 

The substantial similarities among English 
letters are indicated by the difficulty in devis- 
ing reliable pattern -recognition devices (Uhr, 
1966). The Gibson, et al . . investigation sug- 
gests that transformational relations may be a 
useful way of describing perceptual similarity. 
What remains to be formulated is a model of 
stimulus structure which can complement fea- 
ture-transformation analyses . Gibson's (1965) 
approach took the form of a "feature listing," 
in which, for example, Z was equal to <straight 
segment, horizontal, oblique/; discontinuity, 
horizontal>. This listing provides all the ele- 
ments or features in Z, but lacks a description 
of how the elements are structurally related. 

A more complete list would include features and 
rules of combination. Thus £ might be roughly 
described as <straight segment, vertical, de- 
scender; curve, closed> . This list consists 
of writing instructions which are applied from 
left to right, with semicolons denoting opera- 
tional breaks . 

o 



The point being made is just this: the per- 
ceptual similarity of two letters depends not 
only on the transformational relations between 
the objects but also on the structure of the 
objects . We know that d and b are easily con- 
fused by kindergartners , \and we have to live 
with the fact that both are letters in English — 
but what about S, and 8 , 5_ and 6 , or N and M ? 

Is it true in general that the right-left transfor- 
mation is a source of confusion, or does this 
hold only for certain structural classes of 
stimuli? An answer to this question might 
lead to a better understanding of the nature of 
b-d and £-£ errors . 

The matching task has much in common with 
visual search tasks of the sort investigated by 
Neisser (1967), in which subjects are presented 
one or more items to remember, and are then 
asked to look through a list until they find one 
of the memorized items . Instructions stress 
speed — the target is to be reported as quickly 
as possible. Neisser has shown that increasing 
visual similarity slows down search rates in 
college students, and Gibson and Yonas (1966) 
have reported similar results using subjects as 
young as second graders. 

The task of reading itself also involves 
heavy memory and visual loads; and although 
speed may not be overtly emphasized, the lag- 
gard is certainly penalized. Theru is little 
research on the effects of item complexity, 
memory load, visual load, visual similarity, 
and stress of speed V£. accuracy on matching 
performance. These variables denote, in turn: 
the choice of letters, words, familiar or non- 
sense forms as item matei A; the number of 
alternatives being searched through; the simi- 
larity (transformational or featural) of the al- 
ternatives in the test set; and the degree to 
which instructions require the subject to re- 
spond as quickly as possible or as accurately 
as possible. The Gibson and Yonas study sug- 
gests that in speeded search, visual similarity 
may affect performance. With similar materials 
but minimal memory and visual load, and mini- 
mal stress on speed, performance was essen- 
tially perfect in the BST matching tests of single 
letters . The difficulty in the more demanding 
search task would appear to be cognitive rather 
than perceptual, except possibly for the prob- 
lem of right-left reversal confusions, to which 
we will now turn our attention. 



Right-Left Reversals 

The fact that children confuse the letter 
pairs b-d and £-£has been repeatedly docu- 
mented (see Fellows, 1968, and Benton, 1959, 
for reviews). The problem is universal, pro- 
found, and persistent. Substantially more than 
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half 6f all kindergarten children confuse these 
pairs sometimes. In a two-choice test, error 
rates of 15-25% are common. The difficulty 
may persist until 9 or 10 years of age in the 
case of children who are "nonreaders." The 
problem has been attributed to various kinds 
of dysfunction — physiological, perceptual, 
memorial, or cognitive. Various training pro- 
cedures have been evaluated, with mixed re- 
sults (Strang, 1967; Harris, 1969; Jeffrey, 
1958). There have been no tests of the long- 
term effects of such training on reading or 
reading-related tasks. The degree of confusion 
appears to depend on the physical arrangement 
of the stimulus objects. For example, Hutten- 
locher (1967a, 1967b) presents evidence that 
right-left reversals occur in a horizontally 
:-^.rranged display, but when a vertical array is 
used, up-down reversals are more frequent than 
right-left reversals . 

If it were not for the existence in the English 
alphabet of the pairs b-d and £-£, this entire 
discussion would be of interest only to investi- 
gators of spatial orientation (Howard and 
Templeton, 19 66). Various efforts have been 
made to determine the number of reading errors 
due to right-left reversals; this is probably a 
hopeless task. It is clear that children do 
make such errors, even under optimal testing 
conditions, and that we do not understand the 
sources of such errors or how to remedy the 
problem. One simple solution might be to use 
only upper-case letters in initial reading in- 
struction, or to adopt some sort of stylistic 
variation in the typography used in initial read- 
ing, (i.e., replace d with 3 and £ with g) . 

Word Matching 

Next we will consider matching of letter 
groups. Although there is considerable evi- 
dence suggesting that experienced readers per- 
ceive words as units rather than as letter 
strings, the manner in which beginning readers 
or prereaders process words is less certain. 
Surprisingly, there does not appear to have 
been a test of the simplest hypothesis, viz., 
that the probability of error in matching a 
standard word with a particular test alternative 
is the product of the error probabilities for the 
constituent letter pairs — assuming independence 
of the letter comparison processes. The prob- 
lem is an interesting one, because there are 
good arguments to suggest that error rates in 
word matching should be greater or less than 
predicted by independent scanning; greater 
because of the increased inform at ion -processing 
load; less, because word configuration cues are 
another source of information. Although there is 



anecdotal evidence to the effect that word con- 
figuration may be important in scanning by ex- 
perienced readers, there appears to be little 
research on the effects of configuration on 
matching by beginners. (Our data do not pro- 
vide a sufficient range of tests of single or 
multiple letter groups for such a test.) 

An important source of word matching errors 
in the BST data and other studies involves con- 
fusions between a standard and an alternative 
consisting of an order permutation of the letters 
in the standard. A special case of this situa- 
tion is the right-left reversal of a word — was 
for saw . Two of the better studies on word 
perception in prereaders come from the older 
literature. Hill (1936) investigated word- 
matching ability in kindergartners and. first 
graders /with a well-constructed test designed 
to evaluate the effectiveness of cues at differ- 
ent locations within a word. She found that 
the first and last letters were more salient than 
those in the middle; that the error rate was 
highest when an alternative preserved the con- 
figuration of the standard; and that the highest 
error rate was observed when two words differed 
by a single mirror-image letter (e.g. , rimd vs. 
rimb ) . Hill did not test mirror-image word 
alternatives , but from her data it is clear that 
other sources of confusion contribute substan- 
tially to word matching errors . • First graders 
made fewer errors than kindergartners , but the 
pattern of errors v/as the same in both groups . 

Davidson (19 34) compared form and word 
reversal confusions in kindergartners and 
first graders . / A five-alternative multiple- 
choice test was employed, the child being 
instructed to select one or more alternatives 
that were the same as the standard. For the 
"nonsense" geometric forms, one of the alter- 
natives was a right-left reversal; for the words, 
one of the alternatives contained the letters of 
the standard in reverse order. Confusions were 
common with both types of materials; 94% of 
the kindergartners / and 62% of the first graders 
made at least one form reversal, and the cor- 
responding figures for words were 83% and 33%. 
The percentage of errors, reversal and others, 
for forms and words, is rather interesting 
(Table 4) . Reversals account for more than 
two -thirds of all form errors in both kinder- 
garten and first grade children, whereas 
"other" errors are in the majority in the 
word tests. [Davidson describes the other 
alternatives simply as "words known to be 
easily confused with the key word."] Thus, 
Davidson's results suggest that order re- 
versal errors are not necessarily the singu- 
lar source of confusion in word perception 
that mirror-image reversals are in single- 
letter matching. 
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Table 4 



Error Distributions in Visual Matching of 
Forms and Words by Kindergarteners and 
First Graders (Davidson, 193fi) 





Forms (10)^ 


Words (15)^ 


Grade 


Reversals 


Other 

Errors 


Total 


Reversals 


other 
■ Errors 


Total 


Kindergarten 
(n = 50) 


23 


11 


34 


22 


26 


■ 48 


First Grade 
(n = 120) 


13 


5 


18 


5 


8 


13 



^Number of items in test. 



Another relevant study is that of Pufall and 
Furth (1966). In one portion of that study, 
children from 4 to 6 years old classified pairs 
of items as same or different. The items con- 
sisted of triads of two objects of one kind (^ 
and a third object of a different (B) (e.g., AAB 
or ABA) . On each trial, the child was presented 
with a pair of triads which had the same or dif- 
ferent sequences (e.g., AAB - AAB or AAB - BAA) . 
Various types of materials were used — colored 
marbles, colored cards, familiar geometric 
forms, line drawings of familiar objects, and 
nonsense forms. Given the high intralist simi- 
larity, error rates were relatively low, ranging 
from 29% at age 4 to 0% at age 6. The result 
most pertinent to order errors was the finding 
that 55% of all errors were mirror-image re- 
versals; i.e., AAB and BAA were identified as 
•’same'' more than half the time. When the 
memory component of the task was increased 
by presenting the two triads successively rather 



than simultaneously, mirror-image reversal 
errors occurred at about the same rate, but 
the rate of other types of errors (e.g. , AAB - 
ABA, "same") doubled. 

In summary, word matching was a relatively 
difficult task for kindergartners , /even though 
they were able to match the component letters. 
Words were handled as strings of more or less 
independent letters, although confusions due 
to configurational similarity also existed. 
Children did not appear to match solely on 
the basis of single-letter identity, although 
initial and final letters were more likely to 
be correctly matched than middle letters. 
Permutation confusions are common and al- 
though there is little systematic research on 
the problem, right-left order reversal con- 
fusions appear to occur frequently in word 
matching but not so exclusively as right- 
left mirror-image confusions in single-letter 
matching. 
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V 

ACOUSTIC-PHONETIC RECOGNITION 



We have argued elsewhere that if a child 
is to learn to read English, almost certainly 
he must learn to make use of letter-sound cor- 
respondences (Venezky, Calfee, & Chapman, 
1969). Learning these correspondences in- 
volves analysis and identification of strings 
of letters and of the phonetic units which cor- 
respond to letters. Most literate adults un- 
questioning ly accept a "discrete-units" theory 
of phonology, probably because they confuse 
letters and sounds. Yet research on this prob- 
lem to be discussed below suggests that pre- 
readers have considerable difficulty performing 
many tasks which require analysis of a spoken 
word into phonetic components . 

The acoustic-phonetic subtests in BST em- 
ployed a variety of procedures to evaluate 
children's competence in this area — recogni- 
tion of word pairs having the same or different 
initiator final phonological segments, produc- 
tion of rhymes, and acquisition and transfer 
tests on a paired-associate problem which 
could be "solved" by noticing that the re- 
sponses were phonological segments of the 
stimuli (e.g., FEEL - EEL) . 

METHOD 

Three of the tests were same-different tasks 
— Rhyming . Rhvming-Retest . and Initial-Sounds . 
In each, the child was informed by instructions 
or pretraining that certain features of word pairs 
would be critical, the final -VC segment in the . 
rhyming tests and the initial segment in 
Initial-Sounds . The child then was asked to 
respond "Yes" or "No" to a series of word 
pairs, depending on whether the critical fea- 
ture was. the same or different in the two words . 

In Rhyming i the child was asked whether or 
not two words "sounded the same at the end"; 
a pretraining series with corrective feedback 



was used to exemplify this concept. The 
Rhyrning-Retest was the same as Rhyming ex- 
cept for materials t but was preceded by the 
Rhyme-Production task to be described shortly. 
In Initial-Sounds the child was asked whether 
or not "two words start with the same sound." 
Three continuants were tested, /s/, /m/, and 
/!/. To simplify the task for the child, all 
tests of a given phoneme were administered 
in a block . Two different testing procedures 
were used. Children in Group A received two 
exemplars of the critical phoneme at the be- 
ginning of each series. Then a series of test 
items was presented and the child was asked 
whether or not each one began with the critical 
sound. Children in Group B were also required 
to repeat one of the exemplars together with 
the target word before the test question was 
asked. 

The Rhvme-Productibn test, designed as 
another means of testing rhyming ability, gon- 
sisted'pf two parts. First, the experimenter 
gave two rhymes for each of eight words spoken 
by the child in a picture-naming task. Thus, 
if the child said "girl" when shown a picture, 
the experimenter might say "pearl, curl.'^ Then 
the child was asked to switch roles, to- give 
a rhyme for each word pronounced by the ex- 
perimenter as the name of a picture. The ex- 
perimenter told the child he was right if the 
child produced a rhyme, and gave two rhymes 
if the child was wrong or gave -no response. 

The Segmentation test consisted of two 
paired-associate tasks,- eagh task eonsistiag^ 
of five study-test trials on a list of three pairs 
followed by a transfer test on six different but 
related stimuli. The stimuli were all familiar 
CVC words, and the responses were the final 
- VC segment of each word. For one list (RW), 
the responses both in training and transfer 
were real words; [F( SH(OUT). P(ILE)], 
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and for the other list (NS) the responses were 
nonsense: [S(OAP). RqPE). ,CH(I^)].5 Chil- 
dren in Group A received the NS list first fol- 
lowed by the RW list. Group B received the 
lists in the reverse order. The children were 
urged to give a response on every test, even 
if it was a guess . 



RESULTS 

The Rhyming . Rhyming-Retest and Initial- 
Sounds test results are quickly described. In 
each test, the percentage of correct responses 
was 49%, compared with a guessing rate of 50%. 
In short, none of the children did better than 
chance on any of the tests . Some variation in 
performance was apparent in response biases 
but it is clear that the children tested could 
not perform any of these tasks — because of an 
inability to grasp the concepts of phonetic 
matching, because the instructions or pretrain- 
ing were inadequate or other short-comings of 
the "same-different" procedure, or any combina- 
tion of these reasons. 

The Rhyming -Production test produced the 
most interesting results . The percentage of 
rhymes produced were 39% (Msn) / and 37% (Bel). 



The data do not provide an especially 
strong test of the role of response familiarity, 
because the responses in the RW list, (- OUT , 
-EEL, - ILE or " aisle" ) . were not much more 
familiar to the children (except for - OUT ) than 
the NS responses (- QPE , -lEF, - IDE ) . 



Some children in both groups refused to attempt 
the task at all (5/21 (Msn ) and 4/22 (Bel )); for 
those children who made an attempt, failures 
to respond were 9% (Msn) and 26% (Bel ) . Some 
children were quite good at the task; 10/21 
(Msn ) and 10/22 (Bel) produced four or more 
appropriate rhymes In eight attempts. Except . 
for three children who produced a single correct 
rhyme, the remaining children failed on all 
eight items, producing noticeably bimodal dis- 
tributions In both samples . A substantial per- 
centage of the rhymes were nonsensical, 51% 
(Msn ) and 44% (Bel) , indicating k grasp of the 
phonological nature of the task/ Only a very 
few of the responses were semantic associates 
(e . g . , pot - coffee cup ) . 

In Table 5 are the results of analyses of the 
Segmentation test In which children learned two 
paired-associate lists based on phonological 
relations. between the stimulus and response 
members and were tested for transfer of this 
learning. Analysis of variance was carried 
out on several dependent variables: (a) num- 
ber of correct responses, (b) phonological 
errors, (c) stimulus repetitions, and (d) other 
errors. The independent variables were school, 
group, RW y^. NS, and training ys_. transfer. 
Aside from confirming that performance of one 
of the Bel groups was significantly poorer than 
the other three groups, the most interesting 
result was a significant (e,< .05) interaction 
between RW and NS' materials during training 
and transfer. The details of this interaction 
are displayed in Fig. 1 . During both training 
and transfer, about .6 of the responses were 
either correct or phonological errors. (Phono- 



Table 5 

Percentage of Correct and Phonological Responses on 
Segmentation Test During Training and Transfer 



Trial 


Training 


Transfer 


1 


2 


3 


4 


5 


Real Word: 














Correct 


35 


44 


50 


51 


60 


24 


Phonological Error 


21 


15 


10 


9 


9 


28 


Nonsense: 














Correct 


27 


34 


44 


44 


48 


124 


Phonological Error 


25 


23 


19 


19 


13 


27 
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Fig. 1 Segmentation Responses to Real Word and 
Nonsense Lists on Training and Transfer 



logical errors were defined as monosyllabic 
responses in which the vowel matched that of 
the stimulus word, excluding repetitions of 
the stimulus.) In transfer these two classes 
of responses occurred about equally often. 

In training on RW lists, .8 of^these two re- 
sponses were correct but only .65 of these 
two classes of responses were correct in NS 
lists. In other words, RW and NS lists pro- 
duced equal numbers of phonologically related 
responses in both training and test; RW lists 
produced a larger proportion of exactly "cor- 
rect" responses but this was not significantly 
related to transfer performance. 

There were few misplaced responses (e.g., 
SHOUT - EEL) in either the RW or NS list, ex- 
cept for the response OUT which was given 
incorrectly in the RW list a total of 34 times 
(out of a possible 430 opportunities). There 
were 9 other misplaced responses in RW lists 
and a total of 8 misplaced responses in NS lists. 

To the extent that the children learn a con- 
cept based on the phonological relations be- 



tween the pairs, one might expect to observe 
the following: (a) transfer to a new list of 
stimuli for children who reached criterion on 
the training list, (b) more rapid learning of 
the second list compared to the first, (c) errors 
should be phonologically related to the stimu- 
lus word, and (d) the distribution of correct 
responses should be somewhat bimodal in both 
training and transfer. 

Except for (b) which might be predicted on 
other grounds as well, each of these hypotheses 
found support in the data, indicating that many 
of the children were sensitive to the phonologi- 
cal character of the pairs. Several children 
responded with an initial consonant substitu- 
tion rather than a deletion as required by the 
task. For example, one child produced WINE , 
WEAM , WOSS . and WAFE for the transfer items 
PINE . TEAM, MOSS , and SAFE , respectively. 

The bimodal. characier of the distributions 
of correct responses in training and transfer 
is noteworthy. If a child had grasped the 
phonological nature of the task by the end of 
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training, most of the responses on the last j 
training trial and on the transfer trial should ' 
be correct responses or phonological errors. 

A strictly all-or-none concept model would 
postulate that if a subject had learned the 
phonological concept, only phonological re- 
sponses would be produced at the end of the 
training and during transfer; otherwise such 
responses would occur at a chance rate. This • 
model is certainly too strict; there would be 
some correct responses due solely to rote 
learning and some errors due to attentional : 
lapses even though a child had grasped the 
concept. The distributions in Table 6, show- 
ing the number of children making various 
numbers of correct or phonologically related 
responses on the last trial and during transfer, 
are significantly bimodal when compared, for 
example, with a binomial distribution. Some 
children achieved perfect or near perfect scores, 
and most others made no correct responses, or 
at most one. Few subjects performed at an 
intermediate level. 

Some children (3 in Msn , 4 in Bel ) persisted 
in simply repeating the stimulus word. The 
study-test procedure which has proven an ef- 
ficient means of teaching paired-associate 
lists to college students (Battig, 1965) was 
inadequate in interrupting stimulus repetition 
errors, perhaps because it did not provide 
immediate negative feedback to the children. 

One child (in the Bel sample) responded to each 
stimulus word with another word (e.g., BOOK , 
HORSE . CAT ) as though performing a word- 



association task. Except for the eight subjects 
just described, there were very few "real word" 
responses which were not phonologically re- 
lated to the stimulus word. 

In summary, although on the Rhyming , 
Rhyming -Retest , and Initial-Sounds children 
were unable to deal with phonetic segments 
by identifying words as same or different on 
the basis of such segments, on Rhyme-Produc- 
tion and Segmentation children produced words 
or word-like units based on phonological seg- 
mentation of stimuli. Moreover, there was a 
definite relation between a child's performance 
on the Segmentation and Rhyme-Production 
tests, as shown in the contingency table in 
Table 7. Also shown in Table 7 are the con- 
tingency tables for Segmentation and Alphabet- 
Production , and Segmentation and Alphabet- 
Recognition . This result suggests that these 
two tests are convergent; that both are tap- 
ping the same set of acoustic-phonetic skills. 
The relation between segmentation and the 
alphabet knpwledge tests, weaker. than the 
previous relation but statistically significant, 
could be taken as evidence that acoustic- 
phonetic skills are important in later reading 
achievement, or as a measure of a general 
performance factor in all the tests, or both. 

The possibility that we may have begun to 
isolate acoustic- phonetic skills in prereaders 
is encouraging. There remains the problem 
of relating this skill to other tasks requiring 
analysis of word sounds, particularly the iso- 
lation and identification of phonemes, and of 



Table 6 

Distribution of Subjects on Segmentation Test; 
Number of Subjects Making 0, 1, 2, or 3 Correct 
or Phonological Responses on Final Training Trial, 
and 0, 1, ... 6 Such Responses on Transfer Trial 
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Table 7 



.1 

Contingency Tables Showing Relation of 
Performance on Segmentation Test with 
Rhyme-Production, Alphabet-Production, 
and Alphabet-Recognition Tests 

Ciiildren in each Sample Partitioned into Those Above and Below Median 



Segmenta- 

tion 


Rhyme- 

Production 


Alphabet- 

Production 


Alphabet- 

Recognition 


Above 


Below 


Above 


Below 


Above 


Below 


Above 


17 


5 


15 


6 


14 


7 


Below 


3 


18 


7 


15 


7 


15 




= 17.1 




6.7 


x^ = 


5.9 




£< .001 


£< .01 


£< ,05 



devising and evaluating various training pro- 
cedures which could be used with children who 
are deficient in this area . 



DISCUSSION 

There has been relatively little research on 
acoustic-phonetic skills in children. Even the 
simplest problems remain to be fully investi- 
gated, such as the types of confusions that 
occur in phoneme perception. Some work has 
been done on children's discrimination of 
minimal pairs using a same -different procedure 
(e.g., "does Aa/ sound the same as /pa/?"; 
Deutsch, 1967; Templin, 1957). Unfortunately 
this procedure is difficult for young children, 
particularly those who are most lacking in gen- 
eral language ability. Studies recently com- 
pleted in our laboratory (Kamil & Rudegeair, 
1969; Skeel, Calfee, & Venezky, 1969) show 
that under optimal conditions , ikindergartners 
and even younger children could discriminate 
between minimal phonetic contrasts, even when 
meaningless CVC's are used as stimuli. One 
factor in obtaining satisfactory performance 
levels by pre-readers on this task was avoid- 
ance of the same -different test procedure . 
Another factor was the use of repeated test ses- 
sions; in the study by Kamil and Rudegeair, 
the* error rate on the second day of testing was 
about half that on the first day. Half of the 
errors which did occur involved phonetic con- 



trasts known to be difficult for adults — /f - 0/ 
and /v - bA 

An early study by Bond (1935) is perhaps, the 
most complete and informative study of auditory 
factors related to reading available. His sub- 
jects were; second and third graders who were 
reading at grade level (Control) or were below 
grade level 6 months or more (Experimental) . 

Half the children had been taught reading by 
a phonetic method; half, by a look-say method. 
Six different auditory tests were administered, 
as well as standardized reading achievement 
tests. The auditory tests were ^ (auditory 
acuity); {sound blending; e.g., tester says 
/be,- ai/, student says /bai/); ^ (auditory 
perception, a potpourri; (a) giving the letter 
name for different sounds as in /bl/ for /bo/i 
(b) giving words beginning with a sound such 
as /kait/ for Aa//i and (c) giving words end- 
ing with a "sound" such as Aip/ for /-ip/); 

AM (auditory memory measured by digit span 
and nonsense-syllable span); M (auditory 
rhyme, reproduction of various tapping patterns), 
and ^ (auditory discrimination, a same -different 
test of minimal pairs such as /res - rez/) . 
Finally, an articulation test was given in which 
responses were elicited by line drawings of 
familiar objects . 

Bond's analyses showed that on ^and ^ 
the Experimental Group was significantly poorer 
than the Control Group in both the Phonetic and 
Look-Say conditions. On Test ^the Experi- 
mental Group was poorer than the Control in the 
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Phonetic condition only, while on IM the Ex- 
perimental Group was poorer than Control in 
the Look-Say condition only. It is tempting 
to conclude that to benefit from phonics read- 
ing instruction it is important to have good 
hearing; whereas in whole-word instruction it 
helps to have a good memory, and 

articulation ability were not significantly re- 
lated to reading performance , m is akin to 
what has been investigated as "auditory-visual 
integration," the ability to match auditory- 
temporal patterns (e.g,, a series of dots and 
dashes as in Morse Code) with a corresponding 
visual-spatial pattern. Performance on this 
kind of test has been shown by several in- 
vestigators to be correlated with reading 
achievement. The most complete study of 
this problem (Muehl,& Kremenak, 1966) showed 
that cross-modal matching (auditory-to -visual 
and visual-to-auditory) was correlated with 
first-grade reading achievement (Pearson j[^'s 
of ,52 and ,39) but intramodal matching (audi- 
tory -to -auditory and visual-to-visual) was not. 
Bond's Mtest wns essentially auditory-to- 
auditory and so both sets of data are consistent 
in saying that intramodal pattern matching is 
unrelated to reading achievement, 

Muehl and Kremenak also combined the four 
auditory-visual-integration scores with various 
reading readiness scores in a discriminant 
function analysis aiid found that the letter- 
naming readiness score was the only signifi- 
cant predictor of first grade reading achieve- 
ment, The predictive value of the child’s 
knowledge of the alphabet in kindergarten is 
well known. Equally apparent is the diagnostic 
uselessness of this information. Cross-modal 
pattern matching ability may be a poorer pre- 
dictor, but would seem to have greater poten- 
tial for diagnosis of a specific deficiency. 

Two other recent studies have examined 
acoustic-phonetic segmentation skills in young 
children. Bruce (1964) had children between 
the mental ages of 5 and 9 say what would be 
left if a particular phoneme was deleted. The 
phoneme was either initial, medial, or final 
(e.g., BRING, WENT, OR EVERY, the under- 
lined letter corresponding to the phoneme to 
be deleted) . Relatively few children were 
tested at the younger ages and the stimulus 
words were quite varied. His pretraining pro- 
cedures were well designed to guarantee high 
levels of performance. Nonetheless, children 
with a mental age of 7 or less were completely 
unable to perform the task. By age 8, more 
than half of the responses were correct, Bruce's 
analysis of errors provides an interesting pic- 
ture of the development of segmentation ability 
during the early school years. The details are 
beyond the scope of this paper but, briefly, from . 



age. 6 on, disregarding omissions and stimulus 
repetitions, more than 90% of the errors are 
Phonologically related to the stimulus word, 
McNeil and Stone (1965) first train'-ed kin- 
dergartners to hear /s/ and /m/; one group 
was trained on 24 real words, a second on 24 
nonsense words. The criterion test consisted 
of four real and four nonsense words, for each 
of which a child was asked, "Do you hear /s/ 

or /m/ in ?" Following training on 

the real words, 58% of the criterion responses 
were correct compared with a guessing rate of 
50%; following nonsense-word training, 78 % 
of the criterion responses were correct. Real 
words were used exclusively in the same- 
different tests in the BST package; McNeil 
and Stone's results suggest that by using 
semantically anomalous stimuli, children 
might more easily focus on the phonological 
properties of the words , 

Identification of significant acoustic- 
phonetic skills is a significant first step, 
but this must be followed by the development 
and evaluation of remedial training programs 
to realize the full potential of this approach. 
The work of the Russian psychologist Elkonin 
(1963) is the most interesting effort at train- 
ing of phonetic or word-analysis skills of 
which we know. The details of his procedure 
are a bit obscure but in general this training 
procedure taught children to match phonemes 
with blank tokens which they arranged on a 
formboard with as many spaces as the word' 
had sounds, A picture representing the word 
was also present. Gradually, specific identi- 
fication of sounds was introduced. and the 
formboard removed; the child had to select 
the appropriate number of tokens and, after 
repeating the word, break it up into individual 
sounds. In spite of the scant details, Elkonin's 
technique looks promising and the results are 
encouraging. 

The investigations above suggest that the 
problems of kindergartners /in acoustic- 
phonetic analysis are. not sensori-perceptual. 

— children can discriminate minimal phonetic 
contrasts about as well as adults and can per- 
form some tasks which require phonetic seg- ' 
mentation. Instead, the problems are cogni- 
tive, as Vernon (1960) concluded: ", , , The 
children (backward readers) were familiar with 
the general shapes of words and the letters 
they contained , but their knowledge was quite 
unsystematic. They knew something about the 
'shapes of letters and letter groups and their 
associated sounds and that the letters had to 
be blended together to form the words but they 
were so confused that they had no certainty as 
to the correct manner of performing these pro- 
cesses, or of coordinating them together. 
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’’The most common feature of reading dis- 
ability is the incapacity to perform the cogni- 
tive processes of analyzing accurately the 
visual and auditory structures of words . The 
backward reader guesses wrong letters, or the 
right letters in the wrong order .... 

’’The fundamental and basic characteristic 
of reading disability appears to be cognitive 
confusion and lack of system. Why even quite 



an intelligent child should fail to realize that 
there is a complete and invariable correspon- 
dence between printed letter shapes and pho- 
netic units remains a mystery which has hot 
yet been solved. It must be attributed to a 
failure in analyzing, abstraction, and gen- 
eralization, but one which, typically, is con- 
fined to linguistics . . . (p. 71)." 
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VI 

LETTER-SOUND ASSOCIATION 



This section focuses on tasks which re- 
/ quired'the children to- learn to associate sounds 
and letter-like symbols. These tasks were 
analogous in many respects to the early stages 
of reading instruction. The materials were 
relatively abstract and unfamiliar. Various 
taisk formats were investigated; in some tests 
a spoken. response was given to a visual sym- 
bol, and in others the experimenter pronounced 
a stimulus "word," and the child had to pick 
out the corresponding visual symbol. Most of 
these- tasks were quite difficult and learning 
was slow. Nonetheles's, detailed analysis of 
the data provided some clues to specific 
sources of difficulty in these tasks. 



METHOD 

Alphabet- Learning was analogous to mem- 
orizing the names of the letters of the alpha- 
bet. Six nonsense forms (after Gibson, et al . , 
1962) served as stimuli, three paired with a 
consonant-vowel response (/we/, /ge/, /je/) 
and three with a vowel-consonant response 
(/ae't/, /^as'd/, y/® b/) . These stimuli were 
chosen to parallel the pattern in the English 
alphabet represented by such sets as B, D, G 
and L, N, S. Except for /as b/, all of the 
responses were real words. 

In this and the other learning tasks, study 
and test trials alternated. In this particular 
task, on the first half of each trial, three 
items were studied and then tested, and on the 
second half of the trial the remaining three 
items were studied and then tested. The list 
was always split on each trial to produce a 
mixed response set; i.e., the /-e/ and /ae,-/ 
subsets were never presented all together. 

Five trials were administered in this manner. 

The Letter-Word task investigated the blend- 
ing of letter-sound combinations . The Letter 
list consisted of three unfamiliar letter forms 
(capital Greek) , denoted M, N, and O, paired 




with the sounds /m/, /n/, and /o/. The Word 
list was derived from the Letter list by pairing 
O with M or N to form MO , NQ i QM , ON and 
associated responses /mo/, /no/, /om/, and 
/on/. / Five study-test sequences were given 
on each list. Half the children learned the 
Letter list first and then the Word list, the 
other half were given the lists in the reverse 
order. In the instructions it v/as pointed out 
that the words consisted of two symbols but 
the individual letter-sound correspondences 
were not stressed. 

The Context-Learning task was directed 
primarily toward the effect of variability in 
letter-sound correspondences on learning these 
correspondences. The test used the capital 
letters D, C, A, and E as stimuli. First, the 
child was shown A and E and taught to point 
to A when the tester said /e/, and to E when 
/i/ was' pronounced. Then the stimulus 
pairs DA and DE were placed before the child 
and he was told to point to DA when /de/ was 
pronounced and to DE when /di/ was pronounced,. 
Then a test was administered; the tester said 
/di/ and /de/, and the child had to point to 
either DE or DA. This type of multiple-choice 
recognition test procedure was used throughout 
this task.- Next, the pairs CA - A©/ and CE 
- /si/ were presented for a study-test sequence. 
[Thus, all of the letter-sound correspondences 
were invariant except for C which was A/ 
before A and /s/ before E . ] There followed a 
series of five study-test trials on the entire 
set of four pairs, unless a criterion of two 
perfect test trials in succession was attained. 

If the child made an incorrect selection, the 

tester said "no, that makes the sound . 

Let's try again." Finally a transfer test was 
given; the set DAZ , DEZ, CAZ . and CEZ was 
presented, and the child was asked to point 
out in turn "which one says ..." /dez/, 

/diz/, /kez/, and /siz/. 

Only the Msn sample received these 
tests . 
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RESULTS 

Alphabet-Learning 

Over the five-trial series, the percentage of 
correct responses to the six-pair list increased 
from 22% to 33%; the percentage of omissions 
decreased from 38% to 16%; correct responses 
as a percentage of responses attempted re- 
mained constant at around 38%. In 28% of the 
attempts, the vowel was appropriate to the 
stimulus shown. Another 17% of the attempts 
were intrusion errors from the nonmatching sub- 
list (e.g., / as d/ was given as a response in 
place of /we/). The remaining errors (17%) 
could not be classified. 

From the figures given above, it might appear 
that the learning was going on in a reasonable 
manner. Response omissions were decreasing; 
more than a third of the attempts were correct; 
and of those attempts which were errors, al- 
most three-quarters gave evidence of response 
learning. In other respects the data are dis- 
couraging, however. First, all of the improve- 
ment took place from the first to the second 
trial, where omissions dropped from 38% to 19%, 
From the second through the fifth trials, there 
was no noteworthy change in any of the meas- 
ures, Second, not one subject out of twenty- 
one reached criterion; in fact, only five chil- 
dren made four or more correct responses on 
the final test of the six-pair list. Third, there 
was no consistency in the retention of pairs 
from one trial to the next. For example, of 
those pairs which were correct on the fourth 
trial, only 43% were correct on the fifth trial. 
Comparing this percentage with the uncondi- 
tional percentage of correct responses on the 
fifth trial, 33%, it can be seen that whether or 
not an item was correct on the fifth trial was 
only slightly dependent on the response of the 
previous trial. This indicates that associative 
learning was not occurring; the children were 
learning the appropriate response terms, but 
were unable to connect these terms with spe- 
cific stimuli. 



Context- Learning 



The results were similar to those of the 
previous test. The percentage of correct re- 
sponses rose from 39% on the first trial to 64% 
on the second trial and then remained constant 
over the remaining trials . Of those items cor- 
rect on the fourth trial, 68% were correct on the 
fifth trial, compared with the unconditional 
percentage correct on the fifth trial, 62%, 

Again, whether or not an item was correct on 
the fourth trial was not predictive of performance 



on the fifth trial. This was a recognition test 
and so the errors can again be attributed to a 
breakdown in association. 

The training series on the consonant- vowel 
pairs was preceded by pretraining on the 
vowels , but there was little evidence that the 
children used this information, A response 
error could be classified as a vowel error 
(DE chosen for /de/), a consonant error (CA 
chosen for /de/), or both (CE chosen for 
/de/) , If the vowel component had been 
learned, vowel errors should have been rare; 
a similar argument can be made with respect 
to consonants. Actually, the even distribu- 
tion of errors indicates the children treated 
the items as wholes (32% vowel errors, 34% 
consonant errors, and 34% both), A similar 
pattern was found in the transfer data , The 
percentage of correct answers was 29% (com- 
pared with a chance rate of 25%) , and the 
errors were 39% vowel, 36% consonant, and 
25% both. The difference during training be- 
tween variant and invariant items (D vs , C) 
was negligible: 5 6% vs, 54% correct, re- 
spectively, This finding is also consistent 
with the thesis that the stimuli were pro- 
cessed as wholes , 

In short, there was little evidence of 
associative learning eiLier of patterns or 
components after the first trial nor was there 
any substantial amount of transfer. On the 
other hand, some children performed quite 
well on the training series: 7 out of 21 
reached a criterion of two perfect trials. 
Moreover, there was some evidence of dif- 
ferential transfer; those children reaching 
criterion during training were correct 43% in 
transfer; those who failed to. reach criterion 
were correct only 20% in transfer. 



Letter-Word 

Only in this test was there evidence of 
substantial learning over trials. The results 
are shown in Fig , 2, Performance on the 
Letter and Word lists was essentially the 
same for both the Letter- Word and Word-Letter 
orders and so the data have been combined 
over orders. As can be seen from the figure, 
in the Letter list there was a steady increase 
over trials in the percentage of correct re- 
sponses, primarily due to a reduction in 
omissions. Of those pairs which were cor- 
rect on the fourth trial, 88% were also cor- 
rect on the fifth trial, compared with an 
overall percentage correct on the fifth trial 
of 66%, In other words, in the Letter list 
subjects retained previously "learned" items 
from one trial to the next and added to this 
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Fig. 2 Letter-Word Test Responses to 

Letter and Word Lists . Over Trials 



store of learned items on each trial. The pat- 
tern of results for the Word list, on the other 
hand, was similar to that previously described 
for the Alphabet and Context tests . None of 
the performance measures changed to any 
noticeable degree over the five trials . 

Summary 

The kindergartners tested found it extremely 
difficult to form associations betv/een the un- 
familiar abstract stimulus -response pairs used 
in this portion of the BST package. The only 
exception was the Letter list of the Letter- 
Word test. The major distinctions between 
this latter list and the others were length 
(three /pairs) and the elemental nature of the 
stimuli and responses (/m/, /n/, and /o/) . 
The effect of list length on paired-associate 
learning is well known but in the BST package 



the variation in length was relatively slight 
(three pairs in the Letter list to six pairs in 
the Alphabet list) . Except for the Letter list, 
the spoken segment comprised two phonemes 
in these tasks. Moreover, there was repeti- 
tion of phonetic elements (e.g., /mo/, /no/, 
/om/, /on/) producing high intralist similarity, 
which after the fact we feel may be a critical 
variable in these particular tests. 

In any event, if a child does not "know his 
ABC's," if he has not learned to identify the 
letters of the alphabet with their corresponding 
"names," it is evident that he faces immedi- 
ately a paired-associate task of some magni- 
tude in beginning reading. Note that the 
names of the letters of the alphabet are, by 
and large, difficult in the same sense that ouv 
materials werej they are combinations of two 
(or more) phonemes used in repetitive combina- 
tions producing high intralist similarity. 
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DISCUSSION 

Distinctive Feature vs. 

Pattern Learning 

Developmental studies of verbal learning 
are relatively scarce, given the volume of re- 
search in this area on college students. As 
Flavell and Hill (1969) point out in a recent 
review, child-learning studies have focused 
on a small number of specific problems — dis- 
crimination, transposition, probability learn- 
ing, and paired-associate learning — and in 
none of these areas have the studies revealed 
anything of much interest relative to develop- 
ment per se . In fact, Flavell and Hill direct 
their most critical comments toward this lacuna. 
They call for research which will in "more 
direct and sensitive ways assess . . . the 
child's knowledge of classes and relations, 
his attentional biases, his perceptual skills, 
and his mnemonic strategies . . . and capaci- 
ties . . . (p. 44)." 

In a particularly. revealing experiment, Ann 
Pick (1965) taught kindergartners to select 
standard visual symbols from a set of alterna- 
tives which included transformations of the 
standard. For example, if b was the standard, 
then the set b, d . would involve a right-left 
transformation. [The letters b and d are for 
illustration only; Gibson's forms were used as 
stimuli.] After a training criterion was reached, 
a transfer test was administered-. In the Con- 
trol group, the transfer task included new 
standards and new transformations (e.g., 
select w from w, m, ), In the Standard 
group, the same standard set was used but 
with new transforms (e.g. , select ^from 
b , £ ) . Finally, in the Transform group, 
there was a different standard set, but the 
transformation remained the same (e.g., se- 
lect £ from £, £ ) . 

It was reasoned that if children had learned 
to choose the standard by reference to an exact 
template or image in memory, then the Standard 
group should perform best on the transfer test, 
since the same templates could be used. If 
selection was based on transformation of spe- 
cific features , then the Transform group should 
do best in transfer. If specific intor-item com- 
parisons were learned, then all three groups 
should perform equally well on transfer. Pick 
found that during transfer the Transform group 
made 40% as many errors as the Control group 
and the Standard group made 70% as many 
errors.. These results indicate that young 
children can abstract transformations of dis- 
tinctive features and use these apart from 
the specific context in which they were orig- 
inally embedded. 



Transfer of Letter-Sound 
Learning 

There remains the intriguing question of v^hy 
such transfer occurs under some conditions and 
not others. In the Letter-Word test in the BST 
package, there was no evidence bl‘ transfer 
based on combination of elements. Silberman 
(1964) has recounted his sad experiences in 
trying to teach kindergartners' "reading" ma- 
trices with the structure below. 



VOWEL-CONSONANT ENDINGS 





-an 


-it 


-at 


-in 


f- 


Test 


Train 


Train 


Train 


r- 


Train 


Test 


Train 


Train 


s- 


Train 


Train 


Test 


Train 


m- 


Train 


Train 


Train 


Test 



In the experimental procedure, the child learned 
the Train CVC's (e,g. ,_f + at = /f as t /) and 
then were given a transfer test on the diagonal 
test items (e.g. , _f + an = /f s n /) . An ini- 
tial version of the procedure yielded no transfer 
at all. In the version which finally evolved, 
the children's attention was directed to the 
component features (C- and -VC) and then they 
were taught to amalgamate the. elements . A 
final training segment provided direct practice 
on transfer to new combinations . After this 
highly structured program, children performed 
reasonably well on transfer items (75% correct). 

It is apparently unwise to assume that trans- 
fer will occur "naturally" or that a process of 
induction or stimulus generalization guarantees 
that children will apply information gained in 
one situation to other similar problems. This 
may happen^but it may not. In particular, 
little transfer occurs when the stimulus-respiDnse 
pairs consist of acoustic- phonetic materials. 

This conclusion is consistent with the earlier, 
discussion of the integrated character of the 
spoken language. 

Other Studies of Letter- 
Sound Association 

A study by Jeffrey and Samuels (1967) bears 
on "learning the ABC's" and transfer of elements 
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in paired-associate learning involving spoken 
materials • Jeffrey and Samuels asked whether 
training on elements or patterns was the more 
efficient way of teaching reading; is more 
transfer achieved by teaching words (MO - 
/mo/, SO - /so/, BA - 

letters (M^ "• /m/, S_- /s/, A - /e/ , E_ - /i/) 
where the transfer set consists of words (ME 
- /mi/, SE - /si/, SA - /se/, - /me/)? 

In the former procedure, the child is trained on 
integrated word-like units and must induce the 
letter-sound regularities, whereas in the latter 
procedure the regularities are taught but the 
child has less experience in working with inte- 
grated units. The kindergarten subjects were 
first trained to scan letter groups from left to 
right and to "blend phonically" (e.g,, (/m/ - 
/i/) equals /mi/). Different groups of sub- 
jects were given Word training; Letter training, 
or were trained on an unrelated Control list of 
cartoon animals paired with proper names , The 
"letters" used as stimuli were fairly complex 
visual symbols . 

Two results from this study are of interest. 
First, in reaching a criterion of one perfect 
trial in training, the Letter group required 13,8 
trials; the Word group, 16.9 trials. Compared 
with the differences in performance over five 
trials in the Letter and Word lists in BST, this 
is a small difference. The Jeffrey-Samuels 
materials were of lower intralist similarity and 
this may account for the differing results. 



Second, the number of trials to criterion on the 
transfer task was 13.5 in the Letter group, 

27. 2 in the Word group, and 29,3 in the Con- 
trol group. Thus, training on letter elements 
substantially facilitated transfer to words in 
the Jeffrey-Samuels study, although it did not 
in the Letter-Word task of BST. Several dif- 
ferences in procedure between the two studies 
might b© responsible for the different results. 

First, in the BST Letter-Word task, original 
learning was not carried out to a criterion as 
in the Jeffrey-Samuels study. This is probably 
not important because subjects who did reach 
criterion on the Letter-Word test showed' no 
more transfer than noncriterion subjects. 

Second, in the Jeffrey-Samuels study the child 
was instructed at the beginning of the transfer 
task, "You can tell what the wc>rd is by making 
each letter sound. Make the first letter sound, 
then the next letter sound. Then try to say the 
word," It would appear than for kinder gartners 
to learn to "read words" in miniature reading 
tests (a) the child should first be taught indi- 
vidual letter-sound correspondences, (b) it 
must be made explicit that words are to be read 
as sound combinations, and (c) instruction in 
phonic blending must be given. These steps 
are necessary because young children do not 
tend naturally to break up reading patterns into 
elements, nor do they naturally induce corres- 
pondences in the process of learning specific 
examples representing particular correspondences. 
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VII 

VOCABULARY 



In one guise or another, a child's knowl- 
edge of vocabulary is an important predictor of 
reading achievement. Vocabulary tests often 
require children to identify uncommon words, 
or to try to label indistinct or ambiguous line 
drawings. Performance on such tests reflects 
the level of general intellectual ability as well 
as vocabulary skills . 

Three different tests of vocabulary knowl- 
edge were included in the BST package. Each 
child was asked to name common objects dis- 
played in full-color pictures, label line draw- 
ings, and to sort the line drawings into the 
most appropriate categorial arrangements. 

None of the objects was likely to be completely 
novel to any child, but the names of some were 
far less common than the names of others. 



METHOD 

In Picture-Naming , the child was asked to 
name each of 22 common objects. The objects 
were photographed in color and the transpar- 
encies were seen in a hand-held viewer In 
Li ne -Drawing-Naming . the child was asked to 
label objects shown as line drawings printed 
on 3 X 5 cards. The sketches incluucrd some 
detail and shading. There were five objects in 
each of six categories and six objects in four 
categories. The categories were animal s . fcod, 
toys , vehicles . articles of clothing , furniture , 
dinnerware, parts of the body , insects , and 
round things . Within these categories, items 
were selected on the basis of distinctiveness 
and familiarity. * 

The line drawing cards were also used in 
the Sorting test, which consisted of three parts; 
(a) two categories without exemplars, (b) four 
categories with exemplars, and (c) four cate- 
gories without exemplars . In each of these 
subtests, the child was given a deck of cards 
containing five cards from each category and 
asked to sort them into groups that "belong 



together," A formboard was used to. force the 
children to sort into the desired number of 
categories. In (b), four exemplar cards were 
placed at the left of each row of the formboard, 
and the child wajs instructed to sort the deck 
into four groups using the exemplars as a basis. 



RESULTS 

The error rate on Picture-Naming was low: 
4,1% (Msn ) and 3.7% (B^), Of the total of 
39 errors, 17 were intraclass errors of one sort 
or another (e.g. , rose for flowers . pie for cake) 
6 were omissions, and the remaining 16 were 
"other" (e.g., neckt ie for hat, bread for apple . 
cookies for pennies ) . ^ The errors were widely 
distributed over subjects — there were 17, 16, 

7, and 3 children who made 0, 1, 2, or 3 errors 
respectively. 

The error rate on Line-Drawing-Naming was 
higher than on Picture -Naming , largely because 
the test contained more unfamiliar items . On 
the eight objects common to the two tests 
(apple , cake , chair , car , cat, dog, horse , 
shoe ) there were actually more errors on Pic- 
ture^Naming than on Line-Drawing-Naming 
(1 2 versus 4). Overall, the error rate was 
15.7% (Msn ) and 20 . 6% (Bel ) on Line-Drawing- 
Naming . Although the rate was slightly higher 
in Bel than Msn, the pattern of errors was the 



. The scoring scheme used to classify these 
responses is admittedly arbitrary in some in- 
stances . In general, our classification was 
conservative; by a more liberal criterion, only 
two or three of the responses were not related 
semantically or perceptually to the stimulus. 
For example, one might argue that necktie for 
hat and cookies for pennies bear some relation 
to the stimulus, even though we classed them 
as "other." 
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same in both sampies, and so analysis will be 
based on the combined data. 

Failures to respond comprised 15% of the 
errors. One quarter of the errors were accounted 
for by five popular substitutions; more than a 
third of the children said wheel for tire , dresser 
or drawer for desk , baby for doll , windmill for 
pin wheel , or crib for bed. A closely related 
error was a coordinate or intraclass confusion, 
such as penny or nickel for half dollar , s pider 
for bee , or goat for cow . (The distinction be- 
tween popular and coordinate confusions was 
based on frequency of intrusion and definitional 
equivalence.) These accounted for an addi- 
tional 34% of the errors. Another 12% were 
errors of superordination or overgeneralization, 
such as money for half dollar , meat for hot dog , 
or face for mouth . The remaining 14% of the 
errors were classified “other." Most of these 
latter errors were the result of overdifferentia- 
tion of the stimulus (e .g . , bird to half dollar ) , 
a few others were incomprehensii;le (e.g., leaf 
for ant, “ boodle alien “ for deer ) . 

Seven words (half dollar , mouth , bee , ant . 
grasshopper , spider , and pitcher ) produced a 
large proportion of errors marked by consider- 
able variation (e.g., pitcher was variously 
described as coffee pot , coffee cup , cup , can, 
kettle . bottle . cream bowl , etc., ant as grass- 
hopper , spider , bug , fly , flea , leaf, dirt , etc.). 
More than a third of the children gave deviant 
responses to these cards. On 28 of the remain- 
ing 38 stimuli, four children or less made re- 
sponses classed as errors. Thus, the distri- 
bution of errors over the stimuli tended to be 
bimodal: stimuli produced many or few errors . 
The distribution of errors over subjects tended 
to be unimodal, contrary to the bimodal subject 
distributions found in-other tests. In Line- 
Drawinq-Namlnq . errors ranged from 1 to 15, 
with 60% of the error scores between 5 and 10. 
The split- half reliabilities were .53 (Msn ) 
and . 80 (Bel ) . 



Sorting 

This test was given to the Msn sample only. 
One performance measure was the number of 
times the most frequent category in each row 
was represented. This measure disregarded 
the cue cards in the four- cate gory-with- ex- 
emplar subtest. The guessing rate was 50% 
for the two-category subtest and 40% for the 
four-category subtests . The observed per- 
centages of correct responses were 70%, 64%, 
and 63% for the two-category, four-category- 
with-exemplar , and four-category-without- 
exemplars subtests, respectively. The per- 
centage correct on the second test dropped to 



56% when correct responses were defined as 
those sortings which matched the exemplar. 

There were substantial individual differ- 
ences on the sorting task, showing up as a 
markedly bimodal distribution. About half of 
the children (10 out of 21) made scores be- 
tween 37 and 48 correct out of 50 possible, 
which is less than one and a half mistakes per 
category. The other 11 children made scores 
of 27 to 22 correct, or about two and a half 
mistakes per category. 

The distribution of sorting errors over the 
stimulus categories, on the other hand, was 
uniform within each of the two four-category 
sets used in the test, although there was a 
sizable difference between the sets . The rele- 
vant data are presented. in Table 8. The fre- 
quencies along the main diagonals are correct 
sorts, the number of times that two items be- 
longing to the same normative category were 
placed in the same stack -by a child. The off- 
diagonal frequencies are category confusion 
errors; thus, in set 1 there were 63 occasions 
in which an item from the kitchen category was 
placed in the same stack as an item from the 
clothes category. Also shown in the table are 
the number of labeling errors for items in each . 
category by children in the Msn and Bel ex- 
amples . 

Several aspects of the data should be noted. 
First, within each set there was little evidence 
of differences in the number of correct sorts for 
any category. There was a difference between 
sets, which could have resulted from specific 
category difficulty, or differential intracategory 
confus ability or both. For example, the toy 
category had the lowest number of correct sorts 
and was also more frequently confused with 
other categories, particularly body parts and 
furniture . This result could reflect a poorly 
developed concept of toy, or the existence of 
specific confusions between toy-body parts 
and toy-furniture ; the data from this study do 
not permit us to decide between these two in- 
terpretations . Second, the distribution of con- 
fusion errors is reasonably uniform. As just 
noted, toy-body parts and toy-furniture con- 
fusions are greater in number than other com- 
binations, but even these maximum deviations 
from a uniform distribution of confusion errors 
arp relatively slight. Third, there is no dis- 
cernible relation between sorting performance 
and labeling performance. In Set ' , there were 
21 labeling errors (disregarding “populars") for 
kitchen items and 2 labeling errors in the ve- 
hicle category; the number of correct sorts in 
these two categories was virtually identical. 
Similarly, in Set 2 there was a large difference 
in labeling errors for the toys and furniture cat- 
egories, yet only a slight difference in sorting 
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Table 8 



Correct Sorts and Intercategory Sorting Confusions 
in Four-Category Sets 



Set 1 



Kitchen 


Clothes 


Animals 


Vehicles 


Categories 


(21, 28) 


(9, 12) 


(8, 18) 


(2, 6) 


112 


63 


70 


63 


Kitchen 




ill 


70 


69 


Clothes 






106 


64 


Animals 








117 


Vehicles 


Toys 


•64 








Food 


79 


84 






Body 

Parts 


101 


77 


93 




Furniture 


111 


90 


59 


78 


Categories 


Toys 
(15, 33) 


Food 
(5, 4) 


Body Parts 
(8, 20) 


Furniture 

(1,4) 



Set 2 



NOTE — Numbers in parentheses are labeling errors in Msn and samples. 



errors. In short, the coherence of a category 
was not related to the availability of labels 
for items in the category for the reasonably 
familiar and concrete items used in this test. 
The pattern of labeling errors was similar In 
the Msn and samples; the Spearman rank- 
order correlation between labeling errors per 
category for the two samples was .85'. 



DISCUSSION 
Vocabulary Size 

Under the conditions used in the BST package, 
the children tested generally gave appropriate 
names for the common objects in the test. 
When errors occurred, only rarely were these 
omissions; most commonly, children produced 
words that were "close'* to the normative label. 
To be sure, the test words were concrete ob- 
jects presented in isolation. The effects of 
variation in material (abstract vs,, concrete. 



verbs and adjective forms ys,. nouns)' and test 
procedure remain to be fully explored. 

Prior to the advent of transformational gram- 
mars , estimation of vocabulary size was one 
of the most popular topics for psycholinguistic 
investigation (Irwin, 1960; Templin, 1957; 
McCarthy, 1954), Performance on various 
types of vocabulary tests were shown to pre- 
dict intelligence and general school achieve- . 
ment, including reading acquisition. This 
last relation might be considered unusual in 
light of the severe limitations on vocabulary 
size and content in most reading series. Sev- 
eral studies have shown vocabulary size in 
kindergartners 'to be substantially greater 
than the number of different words in most 
reading series. 

It seemed to us that a vocabulary test which 
used sufficiently direct means of eliciting the 
names of common objects would not distinguish 
between good and poor readers. Our assump- 
tion was that virtually all children would know 
the .most common words in the language, 
though they might vary in the extent to which 
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they were familiar with less common words.^ 
There is some evidence from BST intertest cor- 
relations to be discussed later that this assump- 
tion is incorrect, that even in identification of 
common words the quality or character of a 
child's label may be related to other cognitive 
skills which we have assumed to be important 
in beginning reading. 

Vocabulary Quality 

A variety of techniques have been used to 
measure vocabulary: samples of free speech, 
elicitation of words from pictures of line draw- 
ings, recognition tests in which the child hears 
a word and is asked to pick out the most appro- 
priate picture in a set, asking the child to de- 
fine a word pror»ounced by the experimenter, 
free association tests in which the child is 
asked to pronounce as many words as he can 
think of in a fixed period of time, or word 
association tests in which the child is asked 
to say the first word that comes to his mind 
when the tester pronounces a stimulus word. 

The "vocabularies" being measured by these 
various techniques might be expected to be 
quite different. 

Templin (1957) distinguished between use 
and recognition vocabulary. She measured the 
former by the Seashore-Eckerson Vocabulary 
Test in which the child is asked to name a pic- 
ture, and the latter by the Ammons Picture Rec- 
ognition Test in which the child is shown sev- 
eral pictures and asked to point to the picture 
corresponding to a word spoken by the tester. 

A group of 3- to 5-year olds received the 
Ammons test, and a group of 6- to 8-year olds 
the Seashore-Eckerson, Each age group in- 
cluded children from upper and lower socio- 
economic levels. There was no substantial 
difference in performance on the Ammons Test 
between children from upper and lower socio- 
economic levels in the age range from 3 to 5 . 
Hov/ever, 6- to 8 -year olds from a lower socio- 
economic level performed substantially more 
poorly on the Seashore-Eckerson Test than an 
upper socioeconomic group. Although the type 
of test was confounded with age, the data sug- 
gest that the nature of vocabulary test may be 
an important variable in its own right, again 
pointing out the necessity of convergent tests . 



Children for whom English is a second 
language, or those from extremely deprived 
backgrounds, might prove exceptions to this 
generalization. 
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There are many methodological problems in 
testing cl.ildren's vocabulary which have not 
been systematically examined. We know of 
no good techniques for testing anything other 
than concrete nouns, concrete adjectives^ and 
action verbs in young children. The method of 
elicitation and the choice of whether words are 
to be tested in a sentential context or in iso- 
lation probably affect performance. Finally, 
the characteristics of the tester are important, 
especially in testing different social and cul- 
tural groups. For example, there is evidence 
that the speech of Negro children from urban 
ghettos depends on whether or not they are 
tested by interviewers of their own race, who 
may (or may not) speak a dialect closer to the 
child's (Entwisle & Greenberger, 1968; Labov, 
1967). 

Although a large amount of research had 
been completed through 1950 on the develop- 
ment of vocabulary size, techniques of analy- 
sis wer^e so unrefined that little can be said 
about the actual process of development, ex- 
cept that vocabulary size increased with age. 
Most investigators and reviewers have raised, 
the problem of what it means to "know" a word. 

If the developing child simply has an increas- 
ing number of words at his disposal, that is 
one thing, however, it has been frequently 
suggested that not just the number of words 
available, but the way in which a child is able 
to use those words changes with age. McCarthy 
(1954) mentions work by Feifel and Lorge (1950) 
and by Gerstein (1949) on the form of defini- 
tional responses used by children from 6 to 14 
years of age in the Stanford-Binet and Wechsler- 
Bellevue Vocabulary items. "The younger chil- 
dren more often employed the use, description, 
illustration, and demonstration types of defi- 
nition as well as inferior explanation and repe- 
tition responses. Older children significantly 
more often used synonym and explanatory re- 
sponses (p. 530)." Unfortunately, when the 
definitional type of vocabulary test is used, 
one cannot tell whether or not the observed 
behavior is an accurate representation of the 
language system or whether one is primarily 
testing the productive system as opposed to 
the receptive or comprehension system. In 
other words, younger and older children may 
understand words equally well, but older chil- 
dren are simply better able to explain their 
understanding. Communications research by 
Krauss and Glucksberg (1969a, 1969b) supports 
the latter interpretation. 

Russell and Saadeh (19 62) also presented 
some interesting results on vocabulary quality. 
They administered a four-alternative-definition 
recognition test to third, sixth, and ninth 
grade subjects. The four answers provided as 
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alternatives were classed as functional , con- 
crete . abstract , and incorrect . [For example, 
COUNT ; to find the number of things in a group 
(functional) , to find how many pennies are in 
your pocket (concrete), to say numbers in or- 
der (abstract), and to tell numbers one after 
"nother (incorrect) , The validity of the results 
in this kind of experiment depends greatly on 
the quality of the alternatives, of course.] 

The data showed a clear shift from concrete to 
abstract responses between the third and sixth 
graders, and this was taken as evidence of 
improved comprehension. 



Word Associations 

Another lead to vocabulary functioning comes 
from the literature on word associations. A 
number of investigators (Entwisle, 1966; 

Palermo & Jenkins, 1964; Ervin, 1961; Brown 
& Berko, 1960) have found a shift from syn- 
tagmatic to paradigmatic word associations 
from about 5 to 7 years of age. [Paradigmatic 
responses are those in which the form class 
of the response matches that of the stimulus, 
such as black-white or boy-girl; syntagmatic 
responses are those of a different form class 
which could precede o^ follow the stimulus 
word in a sentence, such as black-dog or boy - 
hit.] This shift can be interpreted as a change 
in the semantic structure of the child's lan- 
guage systems, or a change in production 
ability. Data on this point was obtained by 
Ervin (1961) who tested children with both 
word association and two -choice recognition 
procedures. In the latter test, children se- 
lected the more appropriate of two associa- 
tions for a given stimulus. Certain stimuli 
were particularly sensitive to the syntagmatic- 
paradigmatic shift, e.g., ball ; bat or play . 

In kindergarten and first grade children, the 
syntagmatic response (play) was selected 67% 
of the time, while sixth graders chose the par- 
adigmatic response (bat ) 57% cf the time. 

Thus , even when production ability is mini- 
mized, the shift is observed. The correspond- 
ing percentages in the productive word-asso- 
ciation test were 71% and 61%. 

The thoughtful work of Entwisle (1966) on 
word associations in young children is worth 
noting, especially since it spans the age range 
of most interest to us. In her concluding re- 
marks, Entwisle is quite specific as to what 
word associations are not: they are not simple 
stimulus -response linkages; they are not ele- 
ments of serial chains taken from language 
samples; they are not representative of more 
than limited portion of the child's language; 
and they do not directly reflect the semantic 



space of the child. On the positive side, 
Entwisle makes five major points in her sum- 
mary; 

(a) The major shift from syntagmatic to para- 
digmatic aissociations around the first 
grade was confirmed but found to be 
much more complex than previously rea- 
lized, 

(b) The age at which the shift occurred de- 
pended on form class and word frequency. 
For adjectives with antonymic contrasts, 
such as bright-dark , the shift occurs 
earlier than for those without such con- 
trasts, such as thirsty - ? , or for verbs 
which as a class do not provide many 
natural contrasts . 

(c) There are asymmetries in the form-class 
relations; adverbs produce adjectives, 
but not vice versa. 

(d) The syntagmatic responses of children 
differ qualitatively from those of adults; 
e.g., noun-verb sequences are used by 
children, whereas adults are more prone 
to produce adj ective -noun pairs . 

(e) Children give a greater variety of asso- 
ciations than do adults. 

Entwisle, following a line of argument also 
developed by McNeill (1966), feels that these 
results, particularly (a), (b), (c), shed light 
on the development of the semantic system;" 

. . . [they] tend to reveal the formation of 
word classes or concepts and so they forecast 
the individual' s potencial ability to emit dif- 
ferent combinations of words from those he 
has heard" (Entwisle, 1966, p. 7), She re- 
jects the idea that in associating to a stimulus 
word the subject finds the word in a mental 
dictionary and pulls off the topmost entry in 
an associative list or response hierarchy. 
Instead, it is assumed that a word in memory 
consists of a feature list (Katz & Fodor, 1963) 
containing both semantic and syntactic fea- 
tures , Hence, dog might be described as 
animal, domestic, . . . common noun, regu- 
lar plural, ... , With increasing age, this 

system of features becomes more fully devel- 
oped and better organized, so that for an adjec 
tive such as hard, a subject has imn\ediate 
associative access to the antonym soft be- 
cause it differs by a single feature. Prior to 
the full elaboration of this system, there may 
be confusions among similar words , because 
the entries in the feature list are not well 
enough established to permit rapid discrimina- 
tion of close associates. 

An aspect of Entwisle' s data of particular 
interest to us which fits naturally into a 
feature-list theory is the presence of many 
acoustic-phonetic or "clang" associates. 




32 



% 



GPO 6I8~669«^ 



particularly in the responses of younger chil~' 
dren. If the phonetic description of a word is 
stored in memory along with the semantic arid 
syntactic features (see Brown & McNeill, 1966), 
one would expect phonetic associates to occur, 
particularly if other associations are not readily 
available. Such associative rhymes were pro- 
duced frequently by urban Negro children 
(Entwisle & Greenl^erger, 1968), by Entwisle'e 
(1966) kindergartners, by Palermo and Jenkins' 
(1964) sample of fifth grade through college 
students, in Ervin's (1961) Kindergarten to 
sixth grade sample, and in a free-association 
study in which college students were asked to 
produce three-letter words as fast as possible 
(D. Nelson, personal communication; a sam- 
ple might be man , tan , ran, f^, f^, bad , 
sad , etc . ) . In Entwis le* s (1966) study , for 
example, the percentages of rhyming responses 
to add (a high frequency, familiar word) by 
kindergarteners, first, second, and third 
graders were 13%, 7.5%, 3%, and 1%, respec- 
tively. For bitte r (low-frequency), the corres- 
ponding percentages were 20%, 18%, 11%, and 
3%. Ervin obtained similar shifts in the rate of 
phonetic associates from kindergarten to sixth 
grade . As she remarked, these age-related 
trends parallel' the shift from phonological to 
semantic conditioning reported by the Russians 
(Razran, 1939) and by Riess (1946). In the 
data of Palermo and Jenkins (1964), words 
such as the, at, and now , which are frequent 
but have relatively little semantic content, 
evoked phonetic associates occasionally even 



in college students (percentages of rhyming 
associates produced by college sophomores 
were 3%, 7%, and 4%, respectively, for the 
stimulus words given above) . 

Entwisle and Greenberger (1968), testing 
Negro children from the Inner City of Baltimore, 
found that the primary associations by first / 
grade Negro children were frequently rhyming 
responses, such as bad, fad, or mad to the 
stimulus word add , or mean and bean as re- 
sponses to clean . These responses usually 
bore no semantic relationship to the stimulus 
word, and in many instances the responses 
were nonsense, such as flrd to bird . In the 
case of urban Negro children, in 55 of the 96 
stimulus words in the Eritwisle-Greenberger 
list, one or more of the three most common 
associates was a rhyming word. The corres- 
ponding count for urban white children was 18 
out of 96. 

The word association task provides a fruit- 
ful method for examining semantic aspects of 
the language system. The frequency of pho- 
netic associations prior ^o about 6 years of 
age is especially interesting. If it Is true 
that analysis of the acoustic properties of 
spoken language does constitute a major hurdle 
to the learning of the decoding portion of read- 
ing, and if the preschool child is more inclined 
than the k.indergartner or first grader to focus 
on the phonetic composition of words, the teach- 
ing of reading (at least that portion concerned 
with decoding) might be more efficiently intro- 
duced at age four or five . 
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Viil 

INTERTEST CORRELATibNS 



RESULTS 

One of our aims in the BST package was to 
develop clusters of tests which converged on 
specific skill areas — visual, acoustic-phonetic, 
learning, and vocabulary. Since performance 
was poor on most of the acoustic-phonetic and 
learning tests , few intertest correlations in 
those areas will be reported, intertest correla- 
tions for the tests with satisfactory performance 
levels are presented in Table 9, grouped accord- 
ing to skill areas. Alphabet-Production and 
Alphabet-Recognition have been placed with the 
visual matching tasks since the latter used real 
letters as stimuli. Three, other variables of 
general interest — age. Metropolitan Readiness 
percentile, and word memory span — have been 
included in the table. 

The following conclusions summarize the 
matrix of correlations . dusters of tests perti- 
nent to these conclusions are enclosed in block 
form in Table 9 . 



Visual Tasks and 
Alphabet Knowledge 

The visual matching tests in the BST pack- 
age did hang together quite well. Of the seven 
intercorrelations involving Tests 2, 3, and 4, 
six were significant at the .01 level, the ether 
at the .05 level. 

There was some relation between knowledge 
of the alphabet and visual matching. Of the 
twelve intercorrelations in this set, for six 
£ < . 01 , for one £ < . 05 , and the other five 
ranged from . 20 to .36. Oddity-Selection and 
Memory-Matchi ng Vvere least related to alpha- 
bet knowledge. . (Alphabet- Production and Rec- 
ognition were significantly related, it will be 
noted.) Thfe relation of performance within the 
visual matching tests was closer than the rela- 
tion between visual matching and. alphabet 
knowledge, which indicates that the visual 



matching tests tapped a component skill other 
than general competence or age. This con- 
clusion would be on firmer footing if visual 
matching performance with non-alphabet stimuli 
such as Gibson forms were tested. 



Vocabulary 

The predicted relation among the vocabulary 
tests in the BST package was found. The 
absence of a correlation between Picture-Naming 
and Line-Drawing may reflect the low error rate 
on the former test. The fact that labeling and 
sorting were unrelated was noted in the pre- 
ceding discussion on the Sorting test. 

Acoustic-Phonetic Tasks 

Segmentation and Rhyme-Production showed 
inconsistent patterns of significant intertest 
correlation in the two S_ groups when only cor- 
rect segmentation responses on Trial 5 and 
transfer were scored. -When phonolcgically 
related responses were combined with correct 
(as in Tables 6, 7, and 9), the strongest rela- 
tion was between Segmentation and Rhyme- 
Production. 



Other BSt Correlations 

There were 77 other correlations among BST 
tests (age excluded) for which no relation was 
predicted; of these, eight were significant at 
the .01 level, five at the .05 level, and the 
remaining were not statistically significant. 

Of the 29 BST test pairs for which both Msn 
and Bel correlations could be computed, only 
one showed significant correlations for both 
groups . In other words , most of the tests were 
reasonably independent of one another. In par- 
ticular, word-memory span bore no significant 
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Table 9 

Intercorrelations Among Selected Tests from the BST Package 



1 . Age (Mo.) 



2. Matching 
Letter groups 



3 . Matching 
Retest 



4 . Oddity- 
Selection 



5. Me: -ory 
Matching 



6. Alphabet 
Production 



7. Alphabet 
Recognition 



8 . Picture 
Naming 



9 . Line-Drav/ing 
Naming 



1 0 . Sorting 



1 1 . Segmentation 



12. Rhyming 
Production 



13. Word-Memory 
Span 



14. Metropolitan 
(Percentile) 



2 3 4 5 6 7 8 9 10 11 



36 40* * 22 35 68** 27 16 23 52** 24 

-01 -20 — — -08 -01 35 04 — 06 



79** 58** 40* 
72** — — 




50** 58** 
41* 67** 


36 27 01 

34 35 




53** 52** 




50** 63** 

34 75** 


60** 48* 08 

46* 34 — 




63** 




24 36 


33 22 25 



01 

54** 



09 

61** 



25 



20 35 21 29 27 20 



61** 27 35 46* 26 

45* 05 04 — 22 



34 59** 15 15 

36 30 — 5 2** 



18 11 04 

08 — 29 



32 29 



59** 



42* 



12 



28 

14 



17 

13 



13 

33 



-19 



07 



13 



14 



m s 



-16 34 

63** 24 



64 

69 



4 

3 



08 

09 



36^ 63 

67** 68 



11 

12 



12 

18 



51** 49 
68** 42 



25 

32 



20 22 



26 



23 



05 45* 



43 



19 



48* 04 

35 -28 



29 17 

35 03 



13 17 

27 -01 



15 15 

54** 32 



24 08 



54** 

71** 



31 

19 



03 

04 



42* 

21 



27 27 
33 31 



46* 

57** 



23 

18 



31 

27 



10 95 5 

,48* 95 5 



67** 82 7 

■36 78 7 



46* 



61 16 



,’36 66 6 

•58** 59 8 



25 

24 



39 40 
37 43 



24 8 1 

14 8 2 



62 20 
41 27 



NOTE — Upper entry in each cell is Msn sample, lower entry is Bel sample. N = 21 for each 

sample, one S in Bel not included because test 14 was not available. Decimals omitted, 
m is per cent correct except for tests 1, 13 and 14; S is the standard deviation. 

* i > .37, £ < .05, one-tailed test. ** . 50 , £< .01, one-tailed test. 



relation to performance on any of the other 
tests. (Many of the correlations are be- 
tween .20 and .35; it may be a general 
law of behavioral testing that any two var- 



iables will jointly account for 5% to 10% of 
the total variance.) Age did not appear to 
be consistently related to performance on 
.most tests . 
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Metropolitan Tests 

For comparison, the intercon'elations among 
the six subtests and percentile rank of the 
Metropolitan Readiness Tests are shown in 
Table 10. Although each subtest was designed 
to tap an independent skill, 14 of the 30 cor- 
relations were significant at the .01 level and 
5 at the .05 level. The remaining 10 were not 
statistically significant. Of the 19 correla- 
tions of Metropolitan Readiness percentile 
with BST tests, 6 were significant at the .01 
level, 6 at the .05 level, and 7 were not sta- 
tistically reliable. In short, although the sub- 
tests of the Metropolitan were not independent, 
the test served well as a general predictor of 
performance in a variety of cognitive skills . 

DISCUSSION 

Cronbach (1967) has recently raised the 
possibility of a marriage between differential 



and experimental psychology. Several con- 
tributors to the volume (Gagn6, 19 67) contain- 
ing Cronbach' s paper take note that these two 
areas have enjoyed an engagement of long 
duration but there is no evidence that the 
affair has ever been consummated. The failure 
arises from the difficulty of simultaneously 
manipulating stimulus variables, training vari- 
ables, task variables, and subject variables. 
Without pretending that our effort represents 
more than a limited step in this direction, 
some of the results are promising. First, 
without benefit of factor analysis, the BST 
data in Table 8 do hang together in a priori 
clusters when they hang together at all. The 
majority of the other intercorrelat ons are neg- 
ligible in one, the other, or both samples. 
Second, distributions of subject or stimulus 
scores are bimodal in a number of tests, which 
permits partitioning of subjects or stimuli in a 
relatively simple pass -fail fashion. The ques- 
tions then follow: (a) why do some subjects 



Table 10 

Intercorrelations Among Subtests of Metropolitan Readiness Tests 





2 


3 


4 


5 


6 


7 


m 


X 


1 . Word 


60** 


06 


30 


48** 


16 


68** 


63 


17 


Meaning 


56** 


69** 


52** 


76** 


51** 


70** 


52 


16 




05 


26 


35 


20 


62** 


64 


15 


2. Listening 


46* 


43* 


53* 


53** 


64** 


59 


13 






05 


40* 


47* 


53** 


69 


18 


3 . Matching 




62** 


63** 


69** 


85** 


49 


25 








50** 


-04 


56** 


85 


16 


4. Alphabet 






71** 


58** 


84** 


40 


27 










26 


82** 


45 


12 


5 . Numbers 








56** 


83** 


38 


17 












55** 


42 


19 


6. Copying 










79** 


60 


27 














62 


20 


7 . Percentile 












41 


27 



NOTE — Upper entry in each cell is Msn sample, lower entry is sample. N = 21 for each 
sample, one S_ in Bel not included because test 14 was not available. Decimals omit- 
ted. m is per cent correct; £ is the standard deviation. 

* X > .37, £< .05, one-tailed test. 

** X > .50, £< .01, one-tailed test. 



perform well and others poorly and (b) why is 
the one group of stimuli satisfactorily handled 
and the other not? In both instances, the an- 
swers will take the form of experimental ma- 
nipulations — training procedures or variations 



in materials which produce satisfactory per- 
formance. It is important to note that feed- 
back under these conditions is immediate — 
you quickly find out when a testing or train- 
ing procedure doesn't work. 



IX 

SUMMARY 



Even though the BST package is in a pre- 
liminary form, and despite the limited number 
of subjects, certain conclusions are worth 
noting. First, if skill components are narrowly 
defined, they appear remarkably independent. 
This independence is not the result of suppress- 
ing variation among individuals; from the bi- 
modal distribution of ^s in Rhyme-Production 
one might well expect high correlations with 
other tasks, and yet these are not found ex- 
cept for the Segmentation test. 

With regard to specific skill areas, it ap- 
pears that visual perception skills contribute 
only minimally to matching tests. Few errors 
are made in the matching of single letters, 
where perceptual problems should play the 
primary role. Instead, the problems which 
arise seem to be of a cognitive nature, such 



as order and memory for forms. The data sug- 
gest that sound matching, segmentation, and 
association of sounds and symbols are poorly 
developed skills in most kindergartners . 
However, testing procedures may be at fault 
— a possibility which must be evaluated more 
fully. 

Finally, two comments can be made about 
the relation of the BST data to reading achieve- 
ment. First, correlational data on this rela- 
tionship will be available in a year and the 
relation can be determined empirically. Sec- 
ond, inquiries about the relation of specific 
component skills to reading achievement may 
be irrelevant to our purpose because most 
achievement tests are not particularly sensi- 
tive to the decoding process on which BST 
package focuses. 
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